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COMBINATION APPROACHES FOR GENERATING IMMUNE RESPONSES 
AGAINST MULTIPLE STRAINS SELECTED FROM A GIVEN SUBTYPE OR 
SEROTYPE 

5 

Technical Field 

The present invention relates to compositions comprising a polynucleotide 
component and a polypeptide component that can be used for the generation of 
immune responses in a subject. In one aspect, the compositions of the present 
10 invention are used in methods to generate immune responses in subjects to which the 
compositions are administered. In another aspect, the compositions of the present 
invention are used in methods of generating neutralizing activity against multiple 
strains derived from a single subtype or serotype of a selected microorganims , for 
example, viruses (e.g., Human Immunodeficiency Virus (HIV)). 

15 

Background of the Invention 

Acquired immune deficiency syndrome (AIDS) is recognized as one of the 
greatest health threats facing modern medicine. There is, as yet, no cure for this 
disease. 

20 In 1983-1984, three groups independently identified the suspected etiological 

agent of AIDS. See, e.g., Barre-Sinoussi et al. (1983) Science 220:868-871; 
Montagnier et al., in Human T-Cell Leukemia Viruses (Gallo, Essex & Gross, eds., 
1984); Vilmer et al. (1984) The Lancet 1:753; Popovic et al. (1984) Science 224:497- 
500; Levy et al. (1984) Science 225:840-842. These isolates were variously called 

25 lymphadenopathy-associated virus (LAV), human T-cell lymphotropic virus type III 
(HTLV-III), or ATDS-associated retrovirus (ARV). All of these isolates are strains of 
the same virus, and were later collectively named Human Immunodeficiency Virus 
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(HIV). With the isolation of a related AIDS-causing virus, the strains originally called 
HIV are now termed HIV-1 and the related virus is called HIV-2 See, e.g., Guyader et 
al. (1987) Nature 326:662-669; Brun-Vezinet et al. (1986) Science 233:343-346; 
Clavel et al. (1986) Nature 324:691-695. 
5 A great deal of information has been gathered about the HIV virus; however, 

to date an effective vaccine has not been identified. Several targets for vaccine 
development have been examined including the env and Gag gene products encoded 
by HIV. Gag gene products include, but are not limited to, Gag-polymerase and Gag- 
protease. Env gene products include, but are not limited to, monomelic gpl20 

10 polypeptides, oligomeric gpl40 polypeptides and gp 160 polypeptides. 

Haas, et al., (Current Biology 6(3):315-324, 1996) suggested that selective 
codon usage by HIV-1 appeared to account for a substantial fraction of the 
inefficiency of viral protein synthesis. Andre, et al., (J. Virol. 72(2):1497-1503, 1998) 
described an increased immune response elicited by DNA vaccination employing a 

15 synthetic gpl20 sequence with modified codon usage. Schneider, et al., (J Virol. 
71(7):4892-4903, 1997) discuss inactivation of inhibitory (or instability) elements 
(INS) located within the coding sequences of the Gag and Gag-protease coding 
sequences. 

The Gag proteins of HIV-1 are necessary for the assembly of virus-like 
20 particles. HIV-1 Gag proteins are involved in many stages of the life cycle of the virus 
including, assembly, virion maturation after particle release, and early post-entry steps 
in virus replication. The roles of HIV-1 Gag proteins are numerous and complex 
(Freed, E.O., Virology 251:1-15, 1998). 

Wolf, et al., (PCT International Publication No. WO 96/30523, published 3 
25 October 1996; European Patent Application, Publication No. 0 449 1 16 Al, published 
2 October 1991) have described the use of altered pr55 Gag of HIV-1 to act as a non- 
infectious retroviral-like particulate carrier, in particular, for the presentation of 
immunologically important epitopes. Wang, et al., (Virology 200:524-534, 1994) 
describe a system to study assembly of HIV Gag-beta-galactosidase fusion proteins 
30 into virions. They describe the construction of sequences encoding HIV Gag-beta- 
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galactosidase fusion proteins, the expression of such sequences in the presence of HIV 

Gag proteins, and assembly of these proteins into virus particles. 

Shiver, et al., (PCT International Publication No. WO 98/34640, published 13 

August 1998) described altering HIV-1 (CAM1) Gag coding sequences to produce 
5 synthetic DNA molecules encoding HIV Gag and modifications of HIV Gag. The 

codons of the synthetic molecules were codons preferred by a projected host cell. 

Recently, use of HIV Env polypeptides in immunogenic compositions has been 

described, (see, U.S. Patent No. 5,846,546 to Hurwitz et al., issued December 8, 1998, 

describing immunogenic compositions comprising a mixture of at least four different 
10 recombinant virus that each express a different HIV env variant; and U.S. Patent No. 

5,840,313 to Vahlne et al., issued November 24, 1998, describing peptides which 

correspond to epitopes of the HIV-1 gpl20 protein). In addition, U.S. Patent No. 

5,876,731 to Sia et al, issued March 2, 1999 describes candidate vaccines against HIV 

comprising an amino acid sequence of a T-cell epitope of Gag linked directly to an 
15 amino acid sequence of a B-cell epitope of the V3 loop protein of an HIV-1 isolate 

containing the sequence GPGR. 

PCT International Publication Nos. WO/00/39302; WO/00/39303; 

WO/00/39304; WO/02/04493; WO/03/004657; WO/03/004620; and WO/03/020876 

described a number of codon-optimized HIV polypeptides, as well as some native HIV 
20 sequences. Further, a variety of HIV polypeptides comprising mutations were 

described. The use of these HIV polypeptides in vaccine compositions and methods 

of immunization were also described. 

The present invention provides improved compositions and methods for 

generating immune responses against multiple subtypes, serotypes, or strains of a 
25 selected microorganism , for example, a virus (e.g., HIV-1). 

Summary of the Invention 

The present invention relates to compositions and methods for their use for 
generating an immune response in a subject. The compositions of the invention 
30 comprise at least two components wherein each component provides a different but 
analogous polypeptide immunogen. The polypeptide immunogen is provided either 
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directly in the form of a polypeptide (including polypeptide fragments, modified 
forms, encapsulated forms, etc.) or indirectly as a polynucleotide immunogen 
(including DNA and/or RNA encoding a polypeptide immunogen). The compositions 
of the present invention may be used in methods to generate immune responses in 
5 subjects to which the compositions are administered, wherein the immune response is 
directed against multiple subtypes, serotypes, or strains of a selected microorganims , 
for example, viruses (e.g., Human Immunodeficiency Virus (HIV)). In a preferred 
embodiment, the present invention relates to compositions comprising a 
polynucleotide component and a polypeptide component that can be used for the 

10 generation of immune responses in a subject, for example, the generation of 

neutralizing antibodies. Other embodiments comprising at least two polynucleotide 
components each providing a different but analogous polypeptide immunogen, or 
embodiments comprising at least two polypeptide components each providing a 
different but analogous polypeptide immunogen are also contemplated. The 

15 compositions of the present invention may be used in methods to generate immune 
responses in subjects to which the compositions are administered, wherein the immune 
response is directed against multiple strains of a first subtype or serotype of a selected 
microorganims , for example, viruses (e.g., Human Immunodeficiency Virus (HIV)). 
In a first aspect, the present invention includes a composition for generating an 

20 immune response in a mammal. These compositions typically comprise 

a polynucleotide component consisting essentially of one polynucleotide 
encoding an HIV immunogenic polypeptide derived from a first HIV strain of a first 
subtype, and 

a polypeptide component comprising one or more HIV immunogenic 
25 polypeptides analogous to the polypeptide encoded by said polynucleotide component, 
with the proviso that at least one HIV immunogenic polypeptide of the polypeptide 
component is derived from a second HIV strain of the first subtype, wherein said first 
HIV strain and said second HIV strain are different. A further embodiment of this 
composition includes the provisos that (i) the polynucleotide component does not 
30 encode an analogous HIV immunogenic polypeptide derived from any subtype other 
than the first subtype, and (ii) the polypeptide component does not comprise an 
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analogous HIV immunogenic polypeptide derived from any subtype other than the 
first subtype. 

In a second aspect, the present invention includes compositions for generating 
an immune response in a mammal. These compositions typically comprise a 
5 polynucleotide component comprising two or more polynucleotide sequences 
comprising coding sequences for two or more analogous HIV immunogenic 
polypeptides derived from a first HIV subtype, wherein the coding sequences for at 
least two of the HIV immunogenic polypeptides are derived from different HIV strains 
of the first subtype, and a polypeptide component that comprises one or more HIV 

10 immunogenic polypeptides analogous to the polypeptide encoded by the 

polynucleotide component, with the proviso that (i) if the polypeptide component 
comprises less than the number of analogous HIV immunogenic polypeptides encoded 
by the polynucleotide component, then the HIV immunogenic polypeptides of the 
polypeptide composition may be derived from the same and/or different HIV strains of 

15 the first subtype, or (ii) if the polypeptide component comprises the same or greater 
than the number of analogous HIV immunogenic polypeptides encoded by the 
polynucleotide component, then at least one of the HIV immunogenic polypeptides of 
the polypeptide composition is derived from a different HIV strain of the first subtype; 
with the provisos that (i) the polynucleotide component does not encode an HIV 

20 immunogenic polypeptide derived from any subtype other than the first subtype, and 
(ii) the polypeptide component does not comprise an HIV immunogenic polypeptide 
derived from any subtype other than the first subtype. 

The polynucleotide components of both of these aspects may comprises at 
least one polynucleotide that is a native polynucleotide. Alternately, or in addition, 

25 the polynucleotide components may comprise at least one polynucleotide that is a 

synthetic polynucleotide. Synthetic polynucleotides may comprise codons optimized 
for expression in mammalian cells (e.g., human cells). The polynucleotide component 
may comprise a single polynucleotide molecule, or two or more different 
polynucleotide molecules, each encoding one or more HIV polypeptides. The 

30 polynucleotide component may comprise DNA or RNA or both. 
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The HIV immunogenic polypeptides (encoded by the polynucleotide 
component and/or those which comprise the polypeptide component) may be HIV 
envelope polypeptides. The HIV polypeptides may comprises one or more mutations 
compared to the wild-type (i.e., naturally-occuring) HIV polypeptide (e.g., in the case 
5 of envelope proteins, at least one of the envelope polypeptides may comprise a 
mutation in the cleavage site or a mutation in the glycosylation site, a deletion or 
modification of the VI region, a deletion or modification of the V2 region, a deletion 
or modification of the V3 region, modifications to expose an envelope binding region 
that binds to a CCR5 chemokine co-receptor, and combinations thereof). Other 

10 immunogenic HIV polypeptides may include, but are not limited to, Gag, Env, Pol, 
Prot, Int, RT, vif, vpr, vpu, tat, rev, and nef polypeptides. 

The first subtype from which the HIV immunogenic polypeptides and coding 
sequences therefore may be selected includes, but are not limited to, the 
following:subtypeA,subtypeB,subtypeC,subtypeD,subtypeE,subtypeF,subtypeG, 

1 5 andsubtypeO, as well as any of the identified CRFs. 

In addition to immunogenic HIV polypeptides and sequences encoding same, 
the polynucleotide component may encode and the polypeptide component may 
comprise one or more additional antigenic polypeptides which may include antigenic 
polypeptides not derived from HIV-1 coding sequences. 

20 The polynucleotide component may further comprise sequences encoding one 

or more control elements compatible with expression in a selected host cell, wherein 
the control elements are operable linked to polynucleotides encoding HIV 
immunogenic polypeptides. Exemplary control elements include, but are not limited 
to, a transcription promoter (e.g., CMV, CMV+intron A, SV40, RSV, HIV-Ltr, 

25 MMLV-ltr, and metallothionein), a transcription enhancer element, a transcription 

termination signal, polyadenylation sequences, sequences for optimization of initiation 
of translation, internal ribosome entry sites, and translation termination sequences. 

The polynucleotide component may comprise further components as described 
herein (e.g., carriers, vector sequences, control sequences, etc.). The polypeptide 

30 component may comprise further components as described herein (e.g., carriers, 
adjuvants, immunoenhancers, etc.). 
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The present invention also includes methods of generating an immune 
response in a subject. In one embodiment of the method, a composition for generating 
an immune response in a mammal of the present invention, for example, as described 
above, is provided. One or more gene delivery vectors comprising the polynucleotides 
5 of the polynucleotide component of the composition are administered to the subject 
under conditions that are compatible with expression of the polynucleotides in the 
subject for the production of encoded HIV immunogenic polypeptides. Further, the 
polypeptide component of the composition for generating an immune response is 
administered to the subject. 

10 The one or more gene delivery vectors and the polypeptide component may be 

administered, for example, concurrently or sequentially. 

The polynucleotide component may comprise further components as described 
herein (e.g., carriers, vector sequences, control sequences, etc.). The polypeptide 
component may comprise further components as described herein (e.g., carriers, 

1 5 adjuvants, immunoenhancers, etc.). 

The one or more gene delivery vectors may comprise, for example, nonviral 
and/or viral vectors. Exemplary non-viral vectors include, but are not limited to 
plasmids or expression cassettes. Exemplary viral vectors include, but are not limited 
to retroviral, lentiviral, alphaviral, poxviral, herpes viral, adeno-associated viral, 

20 polioviral, measles viral, adenoviral vectors, or other known viral vectors. The one or 
more gene delivery vectors may be delivered using a particulate carrier, for example, 
coated on a gold or tungsten particle and the coated particle may be delivered to the 
subject using a gene gun, or PLG particles delivered by electroporation or otherwise. 
Alternatively, the one or more gene delivery vectors are encapsulated in a liposome 

25 preparation. The one or more gene delivery vectors may be administered, for 

example, intramuscularly, intramucosally, intranasally, subcutaneously, intradermally, 
transdermally, intravaginally, intrarectally, orally, intravenously, or by combinations 
of these methods. 

The subjects of the methods of the present invention are typically mammals, 
30 for example, humans. 
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The immune response generated by the methods of the present invention may 
be humoral and/or cellular. In one embodiment, the immune response results in 
generating neutralizing antibodies in the subject against multiple strains derived from 
the first HIV subtype. 

5 These and other embodiments of the present invention will readily occur to 

those of ordinary skill in the art in view of the disclosure herein. 

Brief Description of the Figures 

Figures 1 A to ID depict the nucleotide sequence of HlVsubtypeC 
10 8_5_TV1_C.ZA (SEQ ID NO: 1 ; referred to herein as TV1). Various regions are 
shown in Table 1 . 

Figures 2A-2E depicts an alignment of Env polypeptides from various HIV 
isolates (Type B-SF162, subtype C-TV1.8_2, subtype C-TV1.8_5, subtype C-TV2.12- 
5/1, subtype C-MJ4, India subtype C-93IN101, subtype A-Q2317, subtype D- 
15 92UG001, subtype E-cm235, and a Consensus Sequence). The arrows indicate 

exemplary regions for deletions and/or truncations in the beta and/or bridging sheet 
region(s). The "*" denotes N-linked glycosylation sites, one or more of which can be 
modified (e.g., deleted and/or mutated; one such possible mutation is mutation (N-> 

Q)). 

20 Figure 3 presents a schematic diagram showing the relationships between the 

following forms of the HrV Env polypeptide: gpl60, gpl40, gpl20, and gp41. 

Figure 4 presents neutralizing antibody activity data against HIV-1 subtype B 
strain SF162 obtained from a number of different immunization protocols in rabbits. 
Figure 5 presents neutralizing antibody activity data against HrV-1 subtype C 
25 strain TV1 obtained from a number of different immunization protocols in rabbits. 

Figure 6 presents the nucleotide sequence of the polynucleotide designated 
gpl40.modSF162.delV2. 

Figure 7 presents the nucleotide sequence of the polynucleotide designated 
gpl40.mut7.modSF162.delV2. 
30 Figure 8 presents the nucleotide sequence of the polynucleotide designated 

gpl40mod.TVl.delV2. 

8 



PP2 1100.001 
PATENT 

Figure 9 presents the nucleotide sequence of the polynucleotide designated 
gpl40mod.TVl.mut7.delV2. 

Figure 10 presents the nucleotide sequence of the polynucleotide designated 
gpl60mod.Q23-17 (optimized sequence based on subtype A HrV-1 isolate Q23-17 
5 from Kenya GenBank Accession AF004885). 

Figure 1 1 presents the nucleotide sequence of the polynucleotide designated 
gpl60mod.98UA01 16 (optimized sequence based on subtype A HIV-1 isolate 
98UA01 16 from Ukraine GenBank Accession AF4 13987). 

Figure 12 presents the nucleotide sequence of the polynucleotide designated 
10 gpl60mod.SE8538 (optimized sequence based on subtype A HIV-1 isolate SE8538 
from Tanzania GenBank Accession AF069669). 

Figure 13 presents the nucleotide sequence of the polynucleotide designated 
gpl60mod.UG031 (optimized sequence based on subtype A Human 
immunodeficiency virus 1 proviral DNA, complete genome, clone:pUG031-Al 
1 5 GenBank Accession AB098330). 

Figure 14 presents the nucleotide sequence of the polynucleotide designated 
gpl60mod.92UG001 (optimized sequence based on subtype D Human 
immunodeficiency virus type 1 complete proviral genome, strain 92UG001 GenBank 
Accession AJ320484). 
20 Figure 15 presents the nucleotide sequence of the polynucleotide designated 

gpl60mod.94UGl 14 (optimized sequence based on subtype D HrV-1 isolate 
94UG1 14 from Uganda GenBank Accession U88824). 

Figure 16 presents the nucleotide sequence of the polynucleotide designated 
gpl60mod.ELI (optimized sequence based on subtype D Human immunodeficiency 
25 virus type 1, isolate ELIGenBank Accession K03454). 

Figure 17 presents the nucleotide sequence of the polynucleotide designated 
gpl60mod.93IN101 (optimized sequence based on Indian subtype C Human 
immunodeficiency virus type 1 subtype C genomic RNA GenBank Accession 
AB023804). 

30 Figure 1 8 presents the nucleotide sequence of the polynucleotide designated 

gpl60mod.cm235.V3con (optimized sequence based on subtype E HIV-1 isolate). 
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Figure 19 presents the nucleotide sequence of the polynucleotide designated 
gpl60partialmod.cm235.V3 con (optimized sequence based on subtype E HIV-1 
isolate). 

Figure 20 present the ELISA data for binding antibody titers for SF162 Env 
5 protein in immunized chimpanzees. 

Figure 21 presents lymphoproliferative data from chimpanzees immunized 
with HIV mn env DNA (as a prime) and HIV sfi62 env protein (as boost). 

Detailed Description of the Invention 

10 The practice of the present invention will employ, unless otherwise indicated, 

conventional methods of chemistry, biochemistry, molecular biology, immunology 
and pharmacology, within the skill of the art. Such techniques are explained fully in 
the literature. See, e.g., Remington 's Pharmaceutical Sciences, 18th Edition (Easton, 
Pennsylvania: Mack Publishing Company, 1990); Methods In Enzymology (S. 

1 5 Colowick and N. Kaplan, eds., Academic Press, Inc.); and Handbook of Experimental 
Immunology, Vols. I-IV (D.M. Weir and C.C. Blackwell, eds., 1986, Blackwell 
Scientific Publications); Sambrook, et al., Molecular Cloning: A Laboratory Manual 
(2nd Edition, 1989); Short Protocols in Molecular Biology, 4th ed. (Ausubel et al. 
eds., 1999, John Wiley & Sons); Molecular Biology Techniques: An Intensive 

20 Laboratory Course, (Ream et al., eds., 1998, Academic Press); PCR (Introduction to 
Biotechniques Series), 2nd ed. (Newton & Graham eds., 1997, Springer Verlag). 

All patents, publications, sequence citations, and patent applications cited in 
this specification are herein incorporated by reference as if each individual patent, 
publication, sequence citation, or patent application was specifically and individually 

25 indicated to be incorporated by reference in its entirety for all purposes. 

As used in this specification, the singular forms "a," "an" and "the" include 
plural references unless the content clearly dictates otherwise. Thus, for example, 
reference to "an antigen" includes a mixture of two or more such agents. 
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1.0.0 Definitions 

In describing the present invention, the following terms will be employed, and 
are intended to be defined as indicated below. 

"Synthetic" sequences, as used herein, refers to HIV polypeptide-encoding 
5 polynucleotides whose expression has been modified as described herein, for example, 
by codon substitution, altered activities, and/or inactivation of inhibitory sequences. 
"Wild-type" or "native" sequences, as used herein, refer to polypeptide-encoding 
polynucleotides that are substantially as they are found in nature, e.g., Gag, Pol, Vif, 
Vpr, Tat, Rev, Vpu, Env and/or Nef encoding sequences as found in HIV isolates, e.g., 

10 SF162, SF2, AF1 10965, AF1 10967, AF1 10968, API 10975, MJ4 (a subtype C, 

Ndung'u et al. (2001) J. Virol. 75:4964-4972), subtype B-SF162, subtype C-TV1.8_2 
(8_2_TV1_C.ZA), subtype C-TV1.8_5 (8_5_TV1_C.ZA), subtype C-TV2.12-5/1 (12- 
5_1_TV2_C.ZA), subtype C-MJ4, India subtype C-93IN101, subtype A-Q2317, 
subtype D-92UG001, subtype E-cm235, subtype A HIV-1 isolate Q23-17 from Kenya 

1 5 GenBank Accession AF004885, subtype A HIV-1 isolate 98UA01 1 6 from Ukraine 
GenBank Accession AF413987, subtype A HIV-1 isolate SE8538 from Tanzania 
GenBank Accession AF069669, subtype A Human immunodeficiency virus 1 proviral 
DNA, complete genome, clone:pUG031-Al GenBank Accession AB098330, subtype 
D Human immunodeficiency virus type 1 complete proviral genome, strain 92UG001 

20 GenBank Accession AJ320484, subtype D HIV-1 isolate 94UG1 14 from Uganda 
GenBank Accession U88824, subtype D Human immunodeficiency virus type 1, 
isolate ELIGenBank Accession K03454, and Indian subtype C Human 
immunodeficiency virus type 1 subtype C genomic RNA GenBank Accession 
AB023804. 

25 The various regions of the HIV genome are shown in Table 1 , with numbering 

relative to 8_5_TV1_C.ZA (Figures 1-A-1D). Thus, the term "Pol" refers to one or 
more of the following polypeptides: polymerase (p6Pol); protease (prof); reverse 
transcriptase (p66RT or RT); RNAseH (pl5RNAseH); and/or integrase (p31Int or Int). 
Identification of gene regions for any selected HIV isolate (e.g., strains within a 

30 subtype, or strains derived from different subtypes) can be performed by one of 

ordinary skill in the art based on the teachings presented herein and the information 
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known in the art, for example, by performing nucleotide and/or polypeptide 
alignments relative to 8_5_TV1_C.ZA (polynucleotide sequence presented in Figures 
1 A- ID) or alignment to other known HIV isolates, for example, Subtype B isolates 
with gene regions (e.g., SF2, GenBank Accession number K02007; SF162, GenBank 
5 Accession Number M38428) and Subtype C isolates with gene regions (e.g., GenBank 
Accession Number AF1 10965 and GenBank Accession Number AF1 10975). 

As used herein, the term "virus-like particle" or "VLP" refers to a 
nonreplicating, viral shell, derived from any of several viruses discussed further 
below. VLPs are generally composed of one or more viral proteins, such as, but not 

10 limited to those proteins referred to as capsid, coat, shell, surface and/or envelope 

proteins, or particle-forming polypeptides derived from these proteins. VLPs can form 
spontaneously upon recombinant expression of the protein in an appropriate 
expression system. Methods for producing particular VLPs are known in the art and 
discussed more fully below. The presence of VLPs following recombinant expression 

1 5 of viral proteins can be detected using conventional techniques known in the art, such . 
as by electron microscopy, X-ray crystallography, and the like. See, e.g., Baker et al., 
Biophys. J. (1991) 60:1445-1456; Hagensee et al., J. Virol. (1994) 68:4503-4505. For 
example, VLPs can be isolated by density gradient centrifugation and/or identified by 
characteristic density banding. Alternatively, cryoelectron microscopy can be 

20 performed on vitrified aqueous samples of the VLP preparation in question, and 
images recorded under appropriate exposure conditions. 

By "particle-forming polypeptide" derived from a particular viral protein is 
meant a full-length or near full-length viral protein, as well as a fragment thereof, or a 
viral protein with internal deletions, which has the ability to form VLPs under 

25 conditions that favor VLP formation. Accordingly, the polypeptide may comprise the 
full-length sequence, fragments, truncated and partial sequences, as well as analogs 
and precursor forms of the reference molecule. The term therefore intends deletions, 
additions and substitutions to the sequence, so long as the polypeptide retains the 
ability to form a VLP. Thus, the term includes natural variations of the specified 

30 polypeptide since variations in coat proteins often occur between viral isolates. The 
term also includes deletions, additions and substitutions that do not naturally occur in 
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the reference protein, so long as the protein retains the ability to form a VLP. 
Preferred substitutions are those which are conservative in nature, i.e., those 
substitutions that take place within a family of amino acids that are related in their side 
chains. Specifically, amino acids are generally divided into four families: (1) acidic ~ 
5 aspartate and glutamate; (2) basic - lysine, arginine, histidine; (3) non-polar ~ 
alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan; 
and (4) uncharged polar — glycine, asparagine, glutamine, cystine, serine threonine, 
tyrosine. Phenylalanine, tryptophan, and tyrosine are sometimes classified as aromatic 
amino acids. 

10 The term "HIV polypeptide" refers to any amino acid sequence that exhibits 

sequence homology to native HIV polypeptides {e.g., Gag, Env, Prot, Pol, RT, Int, vif, 
vpr, vpu, tat, rev, nef and/or combinations thereof) and/or which is functional. Non- 
limiting examples of functions that may be exhibited by HIV polypeptides include, use 
as immunogens {e.g., to generate a humoral and/or cellular immune response), use in 

15 diagnostics {e.g. bound by suitable antibodies for use in ELISAs or other 

immunoassays) and/or polypeptides which exhibit one or more biological activities 
associated with the wild type or synthetic HIV polypeptide. For example, as used 
herein, the term "Gag polypeptide" may refer to a polypeptide that is bound by one or 
more anti-Gag antibodies; elicits a humoral and/or cellular immune response; and/or 

20 exhibits the ability to form particles. 

An "antigen" refers to a molecule containing one or more epitopes (either 
linear, conformational or both) that will stimulate a host's immune system to make a 
humoral and/or cellular antigen-specific response. The term is used interchangeably 
with the term "immunogen." Normally, a B-cell epitope will include at least about 5 

25 amino acids but can be as small as 3-4 amino acids. A T-cell epitope, such as a CTL 
epitope, will include at least about 7-9 amino acids, and a helper T-cell epitope at least 
about 12-20 amino acids. Normally, an epitope will include between about 7 and 15 
amino acids, such as, 9, 10, 12 or 15 amino acids. The term "antigen" denotes both 
subunit antigens, (i.e., antigens which are separate and discrete from a whole organism 

30 with which the antigen is associated in nature), as well as, killed, attenuated or 

inactivated bacteria, viruses, fungi, parasites or other microbes. Antibodies such as 
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anti-idiotype antibodies, or fragments thereof, and synthetic peptide mimotopes, which 
can mimic an antigen or antigenic determinant, are also captured under the definition 
of antigen as used herein. Similarly, an oligonucleotide or polynucleotide which 
expresses an antigen or antigenic determinant in vivo, such as in gene therapy and 
5 DNA immunization applications, is also included in the definition of antigen herein. 

For purposes of the present invention, antigens (e.g., polynucleotide encoding 
antigens, or polypeptides comprising antigens) can be derived from any 
microorganism having more than one subtype, serotype, or strain variation (e.g., 
viruses, bacteria, parasites, fungi, etc.). The term also intends any of the various 

10 tumor antigens. Furthermore, for purposes of the present invention, an "antigen" 
refers to a protein which includes modifications, such as deletions, additions and 
substitutions (generally conservative in nature), to the native sequence, so long as the 
protein maintains the ability to elicit an immunological response, as defined herein. 
These modifications may be deliberate, as through site-directed mutagenesis, or may 

1 5 be accidental, such as through mutations of hosts which produce the antigens. 

An "immunological response" to an antigen or composition is the development 
in a subject of a humoral and/or a cellular immune response to an antigen present in 
the composition of interest. For purposes of the present invention, a "humoral 
immune response" refers to an immune response mediated by antibody molecules, 

20 while a "cellular immune response" is one mediated by T-lymphocytes and/or other 
white blood cells. One important aspect of cellular immunity involves an antigen- 
specific response by cytolytic T-cells ("CTL"s). CTLs have specificity for peptide 
antigens that are presented in association with proteins encoded by the major 
histocompatibility complex (MHC) and expressed on the surfaces of cells. CTLs help 

25 induce and promote the destruction of intracellular microbes, or the lysis of cells 
infected with such microbes. Another aspect of cellular immunity involves an 
antigen-specific response by helper T-cells. Helper T-cells act to help stimulate the 
function, and focus the activity of, nonspecific effector cells against cells displaying 
peptide antigens in association with MHC molecules on their surface. A "cellular 

30 immune response" also refers to the production of cytokines, chemokines and other 
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such molecules produced by activated T-cells and/or other white blood cells, including 
those derived from CD4+ and CD8+ T-cells. 

A composition or vaccine that elicits a cellular immune response may serve to 
sensitize a vertebrate subject by the presentation of antigen in association with MHC 
5 molecules at the cell surface. The cell-mediated immune response is directed at, or 
near, cells presenting antigen at their surface. In addition, antigen-specific T- 
lymphocytes can be generated to allow for the future protection of an immunized host. 

The ability of a particular antigen to stimulate a cell-mediated immunological 
response may be determined by a number of assays, such as by lymphoproliferation 

10 (lymphocyte activation) assays, CTL cytotoxic cell assays, or by assaying for T- 
lymphocytes specific for the antigen in a sensitized subject. Such assays are well 
known in the art. See, e.g., Erickson et al, J. Immunol. (1993) 151:4189-4199; Doe et 
al., Eur, J. Immunol. (1994) 24:2369-2376. Recent methods of measuring cell- 
mediated immune response include measurement of intracellular cytokines or cytokine 

15 secretion by T-cell populations, or by measurement of epitope specific T-cells (e.g., by 
the tetramer technique)(reviewed by McMichael, A.J., and O'Callaghan, C.A., Exp. 
Med. 187(9)1367-1371, 1998; Mcheyzer-Williams, M.G., et al, Immunol. Rev. 150:5- 
21, 1996; Lalvani, A., et al, /. Exp. Med. 186:859-865, 1997). 

Thus, an immunological response as used herein may be one that stimulates the 

20 production of antibodies (e.g., neutralizing antibodies that block bacterial toxins and 
pathogens such as viruses entering cells and replicating by binding to toxins and 
pathogens, typically protecting cells from infection and destruction). The antigen of 
interest may also elicit production of CTLs. Hence, an immunological response may 
include one or more of the following effects: the production of antibodies by B-cells; 

25 and/or the activation of suppressor T-cells and/or memory/effector T-cells directed 
specifically to an antigen or antigens present in the composition or vaccine of interest. 
These responses may serve to neutralize infectivity, and/or mediate antibody- 
complement, or antibody dependent cell cytotoxicity (ADCC) to provide protection to 
an immunized host. Such responses can be determined using standard immunoassays 

30 and neutralization assays, well known in the art. (See, e.g., Montefiori et al. (1988) 

15 
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Clin Microbiol. 26:231-235; Dreyer et al. (1999) AIDS Res Hum Retroviruses (1999) 
15(17):1563-1571). 

An "immunogenic HIV polypeptide" is a polypeptide capable of elicting an 
immune response against one or more native HIV polypeptides, when the 
5 immunogenic polypeptide is adminstered to a laboratory test animal (such as a mouse, 
guina pig, rhesus macaque, chimpanzee, baboon, etc.). 

An "immunogenic composition" is a composition that comprises an antigenic 
molecule where administration of the composition to a subject results in the 
development in the subject of a humoral and/or a cellular immune response to the 

10 antigenic molecule of interest. The immunogenic composition can be introduced 
directly into a recipient subject, such as by injection, inhalation, oral, intranasal and 
mucosal (e.g., intra-rectally or intra-vaginally) administration. 

By "subunit vaccine" is meant a vaccine composition which includes one or 
more selected antigens but not all antigens, derived from or homologous to, an antigen 

1 5 from a pathogen of interest such as from a virus, bacterium, parasite or fungus. Such a 
composition is substantially free of intact pathogen cells or pathogenic particles, or the 
lysate of such cells or particles. Thus, a "subunit vaccine" can be prepared from at 
least partially purified (preferably substantially purified) immunogenic polypeptides 
from the pathogen, or analogs thereof. The method of obtaining an antigen included 

20 in the subunit vaccine can thus include standard purification techniques, recombinant 
production, or synthetic production. 

"Substantially purified" general refers to isolation of a substance (compound, 
polynucleotide, protein, polypeptide, polypeptide composition) such that the substance 
comprises the majority percent of the sample in which it resides. Typically in a sample 

25 a substantially purified component comprises 50%, preferably 80%-85%, more 
preferably 90-95% of the sample. Techniques for purifying polynucleotides and 
polypeptides of interest are well-known in the art and include, for example, ion- 
exchange chromatography, affinity chromatography and sedimentation according to 
density. 

30 A "polynucleotide coding sequence" or a polynucleotide sequence that 

"encodes" a selected polypeptide, is a nucleic acid molecule that is transcribed (in the 
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case of DNA) and translated (in the case of mRNA) into a polypeptide in vivo when 
placed under the control of appropriate regulatory sequences (or "control elements"). 
The boundaries of the coding sequence are determined by a start codon, for example, 
at or near the 5' terminus and a translation stop codon, for example, at or near the 3' 
5 terminus. A coding sequence can include, but is not limited to, cDNA from viral, 
procaryotic or eucaryotic mRNA, genomic DNA sequences from viral or procaryotic 
DNA, and even synthetic DNA sequences. Exemplary coding sequences are codon 
optimized viral polypeptide-coding sequences used in the present invention. The 
coding regions of the polynucleotide sequences of the present invention are 

10 identifiable by one of skill in the art and may, for example, be easily identified by 
performing translations of all three frames of the polynucleotide and identifying the 
frame corresponding to the encoded polypeptide, for example, a synthetic nef 
polynucleotide of the present invention encodes a nef-derived polypeptide. A 
transcription termination sequence may be located 3' to the coding sequence. 

15 Typical "control elements", include, but are not limited to, transcription 

regulators, such as promoters, transcription enhancer elements, transcription 
termination signals, and polyadenylation sequences; and translation regulators, such as 
sequences for optimization of initiation of translation, e.g., Shine-Dalgarno (ribosome 
binding site) sequences, internal ribosome entry sites (IRES) such as the ECMV IRES, 

20 Kozak-type sequences (i.e., sequences for the optimization of translation, located, for 
example, 5' to the coding sequence, e.g., GCCACC placed in front (5') of an initiating 
ATG), leader sequences, translation initiation codon (e.g., ATG), and translation 
termination sequences {e.g., TAA or, preferably, TAAA placed after (3') the coding 
sequence). In certain embodiments, one or more translation regulation or initiation 

25 sequences {e.g., the leader sequence) are derived from wild-type translation initiation 
sequences, i.e., sequences that regulate translation of the coding region in their native 
state. Wild-type leader sequences that have been modified, using the methods 
described herein, also find use in the present invention. Promoters can include 
inducible promoters (where expression of a polynucleotide sequence operably linked 

30 to the promoter is induced by an analyte, cofactor, regulatory protein, etc.), repressible 
promoters (where expression of a polynucleotide sequence operably linked to the 
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promoter is induced by an analyte, cofactor, regulatory protein, etc.), and constitutive 
promoters. 

A "nucleic acid" molecule or "polynucleotide" can include, but is not limited 
to, procaryotic sequences, eucaryotic mRNA, cDNA from eucaryotic mRNA, genomic 
5 DNA sequences from eucaryotic (e.g., mammalian) DNA, and even synthetic DNA 
sequences. The term also captures sequences that include any of the known base 
analogs of DNA and RNA. In referring to the polynucleotide of the invention, in 
those examples in which "DNA" is specifically recited, it will be apparent that for 
many such embodiments, RNA is likewise intended. 

10 "Operably linked" refers to an arrangement of elements wherein the 

components so described are configured so as to perform their usual function. Thus, a 
given promoter operably linked to a coding sequence is capable of effecting the 
expression of the coding sequence when the proper enzymes are present. The 
promoter need not be contiguous with the coding sequence, so long as it functions to 

1 5 direct the expression thereof. Thus, for example, intervening untranslated yet 

transcribed sequences can be present between the promoter sequence and the coding 
sequence and the promoter sequence can still be considered "operably linked" to the 
coding sequence. 

"Recombinant" as used herein to describe a nucleic acid molecule means a 
20 polynucleotide of genomic, cDNA, semisynthetic, or synthetic origin which, by virtue 
of its origin or manipulation: (1) is not associated with all or a portion of the 
polynucleotide with which it is associated in nature; and/or (2) is linked to a 
polynucleotide other than that to which it is linked in nature. The term "recombinant" 
as used with respect to a protein or polypeptide means a polypeptide produced by 
25 expression of a recombinant polynucleotide. "Recombinant host cells," "host cells," 
"cells," "cell lines," "cell cultures," and other such terms denoting procaryotic 
microorganisms or eucaryotic cell lines cultured as unicellular entities, are used 
interchangeably, and refer to cells which can be, or have been, used as recipients for 
recombinant vectors or other transfer DNA, and include the progeny of the original 
30 cell which has been transfected. It is understood that the progeny of a single parental 
cell may not necessarily be completely identical in morphology or in genomic or total 
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DNA complement to the original parent, due to accidental or deliberate mutation. 
Progeny of the parental cell which are sufficiently similar to the parent to be 
characterized by the relevant property, such as the presence of a nucleotide sequence 
encoding a desired peptide, are included in the progeny intended by this definition, 
5 and are covered by the above terms. 

Techniques for determining amino acid sequence "similarity" are well known 
in the art. In general, "similarity" means the exact amino acid to amino acid 
comparison of two or more polypeptides at the appropriate place, where amino acids 
are identical or possess similar chemical and/or physical properties such as charge or 

10 hydrophobicity. A so-termed "percent similarity" then can be determined between the 
compared polypeptide sequences. Techniques for determining nucleic acid and amino 
acid sequence identity also are well known in the art and include determining the 
nucleotide sequence of the mRNA for the gene encoding the amino acid sequence 
(usually via a cDNA intermediate) and determining the amino acid sequence encoded 

15 thereby, and comparing this to a second amino acid sequence. In general, "identity" 
refers to an exact amino acid to amino acid or nucleotide to nucleotide 
correspondence of two polypeptide sequences or polynucleotide sequences, 
respectively. 

Two or more polynucleotide sequences can be compared by determining their 
20 "percent identity." Two or more amino acid sequences likewise can be compared by 
determining their "percent identity." The percent identity of two sequences, whether 
nucleic acid or peptide sequences, is generally described as the number of exact 
matches between two aligned sequences divided by the length of the shorter sequence 
and multiplied by 100. An approximate alignment for nucleic acid sequences is 
25 provided by the local homology algorithm of Smith and Waterman, Advances in 

Applied Mathematics 2:482-489 (1981). This algorithm can be extended to use with 
peptide sequences using the scoring matrix developed by Dayhoff, Atlas of Protein 
Sequences and Structure, M.O. Dayhoff ed., 5 suppl. 3:353-358, National Biomedical 
Research Foundation, Washington, D.C., USA, and normalized by Gribskov, Nucl. 
30 Acids Res. 14(6):6745-6763 (1986). An implementation of this algorithm for nucleic 
acid and peptide sequences is provided by the Genetics Computer Group (Madison, 
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WI) in their BestFit utility application. The default parameters for this method are 
described in the Wisconsin Sequence Analysis Package Program Manual, Version 8 
(1995) (available from Genetics Computer Group, Madison, WI). Other equally 
suitable programs for calculating the percent identity or similarity between sequences 
5 are generally known in the art. 

For example, percent identity of a particular nucleotide sequence to a reference 
sequence can be determined using the homology algorithm of Smith and Waterman 
with a default scoring table and a gap penalty of six nucleotide positions. Another 
method of establishing percent identity in the context of the present invention is to use 

1 0 the MPSRCH package of programs copyrighted by the University of Edinburgh, 

developed by John F. Collins and Shane S. Sturrok, and distributed by IntelliGenetics, 
Inc. (Mountain View, CA). From this suite of packages, the Smith- Waterman 
algorithm can be employed where default parameters are used for the scoring table 
(for example, gap open penalty of 12, gap extension penalty of one, and a gap of six). 

15 From the data generated, the "Match" value reflects "sequence identity." Other 

suitable programs for calculating the percent identity or similarity between sequences 
are generally known in the art, such as the alignment program BLAST, which can also 
be used with default parameters. For example, in a preferred embodiment, BLASTN 
and BLASTP can be used with the following default parameters for nucleic acid 

20 searches -- genetic code = standard; filter = none; strand = both; cutoff = 60; expect = 
10; Matrix = BLOSUM62; Descriptions = 50 sequences; sort by = HIGH SCORE; 
Databases = non-redundant, GenBank + EMBL + DDBJ + PDB + GenBank CDS 
translations + Swiss protein + Spupdate + PIR; (ii) polypeptide searches ~ . Details of 
these programs can be found at the following internet address: www.ncbi.nlm.gov/cgi- 

25 bin/BLAST. 

Protein similarity and percent identity sequence searches can be carried out, 
for example, using Smith- Waterman Similarity Search algorithms (e.g., at 
www.ncbi.nlm.gov, or from commercial sources, such as, TimeLogic Corporation, 
Crystal Bay, NV). For example, in a preferred embodiment, the Smith- Waterman 
30 Similarity Search can be used with default parameters, for example, as follows: 
Weight MATRIX = BLOSUM62.MAA; Gap Opening PENALTY = -12; Gap 
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Extension PENALTY = -2; FRAME PENALTY = 0; QUERY FORMAT = 
FASTA/PEARSON; QUERY TYPE = AA; QUERY SEARCH = 1; QUERY SET = 
CGI_ld82ws301bde.seq; TARGET TYPE = AA; TARGET SET = NRPdb gsaa; 
SIGNIFICANCE = GAPPED; MAX SCORES - 30; MAX ALIGNMENTS = 20; 
5 Reporting THRESHOLD = Score= 1 ; ALIGNMENT THRESHOLD = 20. 

One of skill in the art can readily determine the proper search parameters to 
use for a given sequence, exemplary preferred Smith Waterman based parameters are 
presented above. For example, the search parameters may vary based on the size of 
the sequence in question. Thus, for polynucleotide sequences of the present invention 

10 the length of the polynucleotide sequence disclosed herein is searched against a 
selected database and compared to sequences of essentially the same length to 
determine percent identity. For example, a representative embodiment of the present 
invention would include an isolated polynucleotide comprising X contiguous 
nucleotides, wherein (i) the X contiguous nucleotides have at least about a selected 

1 5 level of percent identity relative to Y contiguous nucleotides of one or more of the 
sequences described herein or fragment thereof, and (ii) for search purposes X equals 
Y, wherein Y is a selected reference polynucleotide of defined length (for example, a 
length of from 15 nucleotides up to the number of nucleotides present in a selected 
full-length sequence). 

20 The sequences of the present invention can include fragments of the sequences, 

for example, from about 15 nucleotides up to the number of nucleotides present in the 
full-length sequences described herein, including all integer values falling within the 
above-described range. For example, fragments of the polynucleotide sequences of 
the present invention may be 30-60 nucleotides, 60-120 nucleotides, 120-240 

25 nucleotides, 240-480 nucleotides, 480- 1 000 nucleotides, and all integer values 
therebetween. 

The synthetic polynucleotides described herein include related polynucleotide 
sequences having about 80% to 100%, greater than 80-85%, preferably greater than 
90-92%, more preferably greater than 95%, and most preferably greater than 98% up 
30 to 100% (including all integer values falling within these described ranges) sequence 
identity to the synthetic polynucleotide sequences disclosed herein when the 
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sequences of the present invention are used as the query sequence against, for 
example, a database of sequences. 

Two nucleic acid fragments are considered to "selectively hybridize" as 
described herein. The degree of sequence identity between two nucleic acid molecules 
5 affects the efficiency and strength of hybridization events between such molecules. A 
partially identical nucleic acid sequence will at least partially inhibit a completely 
identical sequence from hybridizing to a target molecule. Inhibition of hybridization 
of the completely identical sequence can be assessed using hybridization assays that 
are well known in the art (e.g., Southern blot, Northern blot, solution hybridization, or 

10 the like, see Sambrook, et al., supra or Ausubel et al., supra). Such assays can be 

conducted using varying degrees of selectivity, for example, using conditions varying 
from low to high stringency. If conditions of low stringency are employed, the 
absence of non-specific binding can be assessed using a secondary probe that lacks 
even a partial degree of sequence identity (for example, a probe having less than about 

1 5 30% sequence identity with the target molecule), such that, in the absence of non- 
specific binding events, the secondary probe will not hybridize to the target. 

When utilizing a hybridization-based detection system, a nucleic acid probe is 
chosen that is complementary to a target nucleic acid sequence, and then by selection 
of appropriate conditions the probe and the target sequence "selectively hybridize," or 

20 bind, to each other to form a hybrid molecule. A nucleic acid molecule that is capable 
of hybridizing selectively to a target sequence under "moderately stringent" typically 
hybridizes under conditions that allow detection of a target nucleic acid sequence of at 
least about 10-14 nucleotides in length having at least approximately 70% sequence 
identity with the sequence of the selected nucleic acid probe. Stringent hybridization 

25 conditions typically allow detection of target nucleic acid sequences of at least about 
10-14 nucleotides in length having a sequence identity of greater than about 90-95% 
with the sequence of the selected nucleic acid probe. Hybridization conditions useful 
for probe/target hybridization where the probe and target have a specific degree of 
sequence identity, can be determined as is known in the art (see, for example, Nucleic 

30 Acid Hybridization: A Practical Approach, editors B.D. Hames and S.J. Higgins, 
(1985) Oxford; Washington, DC; fRL Press). 
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With respect to stringency conditions for hybridization, it is well known in the 
art that numerous equivalent conditions can be employed to establish a particular 
stringency by varying, for example, the following factors: the length and nature of 
probe and target sequences, base composition of the various sequences, concentrations 
5 of salts and other hybridization solution components, the presence or absence of 
blocking agents in the hybridization solutions (e.g., formamide, dextran sulfate, and 
polyethylene glycol), hybridization reaction temperature and time parameters, as well 
as, varying wash conditions. The selection of a particular set of hybridization 
conditions is selected following standard methods in the art (see, for example, 

10 Sambrook, et al., supra or Ausubel et al., supra). 

A first polynucleotide is "derived from" second polynucleotide if the first 
polynucleotide has the same basepair sequence as a region of the second 
polynucleotide, its cDNA, complements thereof, or if the first polynucleotide displays 
subtantial sequence identity to a region of the second polynucleotide, its cDNA, 

15 complements thereof, wherein sequence identity is determined as described above. 
Substantial sequence identity is typically about 90% or greater, preferably about 95% 
or greater, more preferably about 98% or greater. 

A first polypeptide is "derived from" a second polypeptide if it is encoded by a 
first polynucleotide derived from a second polynucleotide, or the first polypeptide has 

20 the same amino acid sequence as the second polypeptide or a portion thereof, or the 
first polypeptide displays substantial sequence identity to the second polypeptide or a 
portion thereof, wherein sequence identity is determined as described above. 
Substantial sequence identity is typically about 90% or greater, preferably about 95% 
or greater, more preferably about 98% or greater. 

25 Generally, a viral polypeptide is "derived from" a particular polypeptide of a 

virus (viral polypeptide) if it is (i) encoded by the same open reading frame of a 
polynucleotide of that virus (viral polynucleotide), or (ii) displays substantial sequence 
identity to a polypeptide of that virus as described above. 

A polypeptide is "derived from" an HIV subtype if it is derived from a 

30 polypeptide present in a member of the subtype, derived from a polypeptide encoded 
by a polynucleotide present in a member of the subtype, encoded by a polynucleotide 
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that is derived from a polynucleotide present in a member of the subtype, or derived 
from a polypeptide encoded by a polynucleotide that is derived from a polynucleotide 
present in a member of the subtype. 

A polypeptide is "derived from" an HIV strain if it is derived from a 
5 polypeptide present in a member of the strain, derived from a polypeptide encoded by 
a polynucleotide present in a member of the strain, encoded by a polynucleotide that is 
derived from a polynucleotide present in a member of the strain, or derived from a 
polypeptide encoded by a polynucleotide that is derived from a polynucleotide present 
in a member of the strain. 

10 "Analogous polypeptides" refers to polypeptides that are encoded by, or 

derived from polypeptides encoded by, the same gene of the same organism but from 
different polynucleotide sources. In the context of the present invention, different 
polynucleotide sources could be different subtypes, different serotypes or different 
strains. Thus, for example, a Gag polypeptide from a Subtype B HIV would be an 

15 analogous polypeptide to a Gag polypeptide from a Subtype C HIV, or an envelope 
polypeptide derived from a first HIV-1 subtype, serotype, or strain would be an 
analogous polypeptide to an envelope polypeptide derived from a second HIV-1 
subtype, serotype, or strain. Examples of types of analogous polypeptides that could 
be derived from different HIV-1 subtypes or strains include, the envelope polypeptides 

20 gp4 1 , gp 1 20, gp 1 40, and gp 1 60, all of which are considered analogous polypeptides. 
Further, such analogous polypeptides may each comprise different alterations or 
mutations, for example, analogous polypeptides derived from the HIV-1 envelope 
gene include, but are not limited to, the following: a gp41 polypeptide, a gpl20 
polypeptide, a gpl40 polypeptide, a gpl60 polypeptide, a gpl40 comprising a deletion 

25 of a portion of the VI loop, a gpl40 polypeptide comprising a deletion of a portion of 
the V2 loop, a gp 140 polypeptide comprising a deletion of a portion of the V3 loop, a 
gpl40 polypeptide with a mutated protease cleavage site, a gpl60 comprising a 
deletion of a portion of the VI loop, a gpl60 polypeptide comprising a deletion of a 
portion of the V2 loop, a gp 160 polypeptide comprising a deletion of a portion of the 

30 V3 loop, and a gpl 60 polypeptide with a mutated protease cleavage site. 
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A "gene" as used in the context of the present invention is a sequence of 
nucleotides in a genetic nucleic acid (viral genome, chromosome, plasmid, etc.) with 
which a genetic function is associated. A gene is a hereditary unit, for example of an 
organism comprising a polynucleotide sequence (e.g., an RNA sequence for HIV-1 or 
5 a proviral HIV-1 DNA sequence), that occupies a specific physical location (a "gene 
locus" or "genetic locus") within the genome of an organism. A gene can encode an 
expressed product, such as a polypeptide or a polynucleotide (e.g., tRNA). 
Alternatively, a gene may define a genomic location for a particular event/function, 
such as the binding of proteins and/or nucleic acids (e.g., 5' LTR), wherein the gene 

10 does not encode an expressed product. Examples of HIV-1 genes include, but are not 
limited to, Gag, Env, Pol (prot, RNase, Int), tat, rev, nef, vif, vpr, and vpu. A gene 
may include coding sequences, such as, polypeptide encoding sequences, and non- 
coding sequences, such as, promoter sequences, poly-adenlyation sequences, 
transcriptional regulatory sequences (e.g., enhancer sequences). Many eucaryotic 

15 genes have "exons" (coding sequences) interrupted by "introns" (non-coding 

sequences). In certain cases, a gene may share sequences with another gene(s) (e.g., 
overlapping genes). It is noted that in the general population, wild-type genes may 
include multiple prevalent versions that contain alterations in sequence relative to each 
other. These variations are designated "polymorphisms" or "allelic variations." 

20 "Purified polynucleotide" refers to a polynucleotide of interest or fragment 

thereof that is essentially free, e.g., contains less than about 50%, preferably less than 
about 70%, and more preferably less than about 90%, of the protein with which the 
polynucleotide is naturally associated. Techniques for purifying polynucleotides of 
interest are well-known in the art and include, for example, disruption of the cell 

25 containing the polynucleotide with a chaotropic agent and separation of the 
polynucleotide(s) and proteins by ion-exchange chromatography, affinity 
chromatography and sedimentation according to density. 

By "nucleic acid immunization" is meant the introduction of a nucleic acid 
molecule encoding one or more selected antigens into a host cell, for the in vivo 

30 expression of an antigen, antigens, an epitope, or epitopes. The nucleic acid molecule 
can be introduced directly into a recipient subject, such as by injection, inhalation, 
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oral, intranasal and mucosal administration, or the like, or can be introduced ex vivo, 
into cells which have been removed from the host. In the latter case, the transformed 
cells are reintroduced into the subject where an immune response can be mounted 
against the antigen encoded by the nucleic acid molecule. 
5 "Gene transfer" or "gene delivery" refers to methods or systems for reliably 

inserting nucleic acid (i.e., DNA or RNA) of interest into a host cell. Such methods 
can result in transient expression of non-integrated transferred DNA, 
extrachromosomal replication and expression of transferred replicons (e.g., episomes), 
or integration of transferred genetic material into the genomic DNA of host cells. 
10 Gene delivery expression vectors include, but are not limited to, vectors derived from 
adenoviruses, adeno-associated viruses, alphaviruses, herpes viruses, measles viruses, 
polio viruses, pox viruses, vesiculoviruses and vaccinia viruses. When used for 
immunization, such gene delivery expression vectors may be referred to as vaccines or 
vaccine vectors. 

1 5 The term "transfection" is used to refer to the uptake of foreign DNA by a cell. 

A cell has been "transfected" when exogenous DNA has been introduced inside the 
cell membrane. A number of transfection techniques are generally known in the art. 
See, e.g., Graham et al. (1973) Virology, 52:456, Sambrook et al. (1989) Molecular 
Cloning, a laboratory manual, Cold Spring Harbor Laboratories, New York, Davis et 

20 al. (1986) Basic Methods in Molecular Biology, Elsevier, and Chu et al. (1981) Gene 
13: 197. Such techniques can be used to introduce one or more exogenous DNA 
moieties into suitable host cells. The term refers to both stable and transient uptake of 
the genetic material, and includes uptake of peptide- or antibody-linked DNAs. 

A "vector" is capable of transferring gene sequences to target cells (e.g., viral 

25 vectors, non-viral vectors, particulate carriers, and liposomes). Typically, "vector 
construct," "expression vector," and "gene transfer vector," mean any nucleic acid 
construct capable of directing the expression of a gene of interest and which can be 
used to transfer gene sequences to target cells. Thus, the term includes cloning and 
expression vehicles, as well as viral vectors. 

30 "Lentiviral vector", and "recombinant lentiviral vector" refer to a nucleic acid 

construct which carries, and within certain embodiments, is capable of directing the 



26 



PP21 100.001 
PATENT 



expression of a nucleic acid molecule of interest. The lentiviral vector include at least 
one transcriptional promoter/enhancer or locus defining element(s), or other elements 
which control gene expression by other means such as alternate splicing, nuclear RNA 
export, post-translational modification of messenger, or post-transcriptional 
5 modification of protein. Such vector constructs must also include a packaging signal, 
long terminal repeats (LTRS) or portion thereof, and positive and negative strand 
primer binding sites appropriate to the retrovirus used (if these are not already present 
in the retroviral vector). Optionally, the recombinant lentiviral vector may also 
include a signal which directs polyadenylation, selectable markers such as Neo, TK, 

10 hygromycin, phleomycin, histidinol, or DHFR, as well as one or more restriction sites 
and a translation termination sequence. By way of example, such vectors typically 
include a 5' LTR, a tRNA binding site, a packaging signal, an origin of second strand 
DNA synthesis, and a 3 'LTR or a portion thereof 

"Lentiviral vector particle" as utilized within the present invention refers to a 

15 lentivirus which carries at least one gene of interest. The retrovirus may also contain a 
selectable marker. The recombinant lentivirus is capable of reverse transcribing its 
genetic material (RNA) into DNA and incorporating this genetic material into a host 
cell's DNA upon infection. Lentiviral vector particles may have a lentiviral envelope, 
a non-lentiviral envelope (e.g., an ampho or VSV-G envelope), or a chimeric 

20 envelope. 

"Alphaviral vector", and "recombinant alphaviral vector" and "alphaviral 
replicon vector" refer to a nucleic acid construct which carries, and within certain 
embodiments, is capable of directing the expression of a nucleic acid molecule of 
interest. The alphaviral vector includes at least one transcriptional promoter/enhancer 

25 or other elements which control gene expression by other means such as alternate 
splicing, nuclear RNA export, post-translational modification of messenger, or post- 
transcriptional modification of protein. Such vector constructs must also include a 
packaging signal, and alphaviral replication recognition sequences. Optionally, the 
recombinant alphaviral vector may also include a signal which directs 

30 polyadenylation, selectable markers such as Neo, TK, hygromycin, phleomycin, 
histidinol, or DHFR, as well as one or more restriction sites and a translation 
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termination sequence. Typically, the alphaviral vector will include coding sequences 
for the alphaviral non-structural proteins, a packaging site, replication recognition 
sequences and a sequence capable of directing the expression of the nucleic acid 
molecule of interest. 

5 "Expression cassette" refers to an assembly which is capable of directing the 

expression of a sequence or gene of interest. An expression cassette typically includes 
a promoter which is operably linked to the polynucleotide sequences or gene(s) of 
interest. Other control elements may be present as well. Expression cassettes 
described herein may be contained within a plasmid construct. In addition to the 

10 components of the expression cassette, the plasmid construct may also include a 

bacterial origin of replication, one or more selectable markers, a signal which allows 
the plasmid construct to exist as single-stranded DNA (e.g., a Ml 3 origin of 
replication), a multiple cloning site, and a "mammalian" origin of replication (e.g., a 
SV40 or adenovirus origin of replication). 

1 5 "Packaging cell" refers to a cell that comprises those elements necessary for 

production of infectious recombinant viral vector, but which lack the recombinant 
viral vector. Typically, such packaging cells contain one or more expression cassettes 
that are capable of expressing proteins necessary for the replication and packaging of 
an introduced vector, for example, in the case of a lentiviral vector expression 

20 cassettes which encode Gag, pol and env proteins, in the case of an alphaviral vector, 
expression cassettes that encode alphaviral structural proteins. 

"Producer cell" or "vector producing cell" refers to a cell which contains all 
elements necessary for production of recombinant viral vector particles. 

Transfer of a "suicide gene" (e.g., a drug-susceptibility gene) to a target cell 

25 renders the cell sensitive to compounds or compositions that are relatively nontoxic to 
normal cells. Moolten, F.L. (1994) Cancer Gene Ther. 1:279-287. Examples of 
suicide genes are thymidine kinase of herpes simplex virus (HSV-tk), cytochrome 
P450 (Manome et al. (1996) Gene Therapy 3:513-520), human deoxycytidine kinase 
(Manome et al. (1996) Nature Medicine 2(5):567-573) and the bacterial enzyme 

30 cytosine deaminase (Dong et al. (1996) Human Gene Therapy 7:713-720). Cells 
which express these genes are rendered sensitive to the effects of the relatively 
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nontoxic prodrugs ganciclovir (HSV-tk), cyclophosphamide (cytochrome P450 2B1), 
cytosine arabinoside (human deoxycytidine kinase) or 5-fluorocytosine (bacterial 
cytosine deaminase). Culver et al. (1992) Science 256:1550-1552, Huber et al. (1994) 
Proc. Natl. Acad. Sci. USA 91:8302-8306. 
5 A "selectable marker" or "reporter marker" refers to a nucleotide sequence 

included in a gene transfer vector that has no therapeutic activity, but rather is 
included to allow for simpler preparation, manufacturing, characterization or testing of 
the gene transfer vector. 

A "specific binding agent" refers to a member of a specific binding pair of 

10 molecules wherein one of the molecules specifically binds to the second molecule 
through chemical and/or physical means. One example of a specific binding agent is 
an antibody directed against a selected antigen. 

By "subject" is meant any member of the subphylum chordata, including, 
without limitation, humans and other primates, including non-human primates such as 

15 baboons, rhesus macaque, chimpanzees and other apes and monkey species; farm 
animals such as cattle, sheep, pigs, goats and horses; domestic mammals such as dogs 
and cats; laboratory animals including rodents such as mice, rats, rabbits, and guinea 
pigs; birds, including domestic, wild and game birds such as chickens, turkeys and 
other gallinaceous birds, ducks, geese, and the like. The term does not denote a 

20 particular age. Thus, both adult and newborn individuals are intended to be covered. 
The system described above is intended for use in any of the above vertebrate species, 
since the immune systems of all of these vertebrates operate similarly. 

By "subtype" is meant a phylogenetic classification of similar organisms into 
groups based on similarities at the genetic (i.e., nucleic acid sequence) level. Such 

25 groups are designated "subtypes." In the HIV field, a well known and widely 
accepted centralized organization for the determination of such similarities and 
classification of particular viral isolates into subtypes is the Los Alamos National 
Laboratory. The HIV subtypes referred to herein are those as determined by the Los 
Alamos National Laboratory. (See, e.g., Myers, et al., Los Alamos Database, Los 

30 Alamos National Laboratory, Los Alamos, New Mexico; Myers, et al., Human 
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Retroviruses and Aids, 1990, Los Alamos, New Mexico: Los Alamos National 
Laboratory.) A subtype can also be referred to as a "clade." 

By "serotype" is meant a classification of similar organisms based on antibody 
cross-reactivity. 

5 By "strain" is intended an organism from within the subtype but which is 

differentiated from other members of the same subtype based on differences in nucleic 
acid sequence. 

By "pharmaceutically acceptable" or "pharmacologically acceptable" is meant 
a material which is not biologically or otherwise undesirable, i.e., the material may be 
10 administered to an individual in a formulation or composition without causing any 
undesirable biological effects or interacting in a deleterious manner with any of the 
components of the composition in which it is contained. 

By "physiological pH" or a "pH in the physiological range" is meant a pH in 
the range of approximately 7.0 to 8.0 inclusive, more typically in the range of 
15 approximately 7.2 to 7.6 inclusive. 

As used herein, "treatment" refers to any of (i) the prevention of infection or 
reinfection, as in a traditional vaccine, (ii) the reduction or elimination of symptoms, 
or (iii) the substantial or complete elimination of the pathogen in question. Treatment 
may be effected prophylactically (prior to infection) or therapeutically (following 
20 infection). 

By "co-administration" is meant administration of more than one composition, 
component of a composition, or molecule. Thus, co-administration includes 
concurrent administration or sequentially administration (in any order), via the same 
or different routes of administration. Non-limiting examples of co-administration 

25 regimes include, co-administration of nucleic acid and polypeptide; co-administration 
of different nucleic acids {e.g., different expression cassettes as described herein 
and/or different gene delivery vectors); and co-administration of different polypeptides 
{e.g., different HIV polypeptides and/or different adjuvants). The term also 
encompasses multiple administrations of one of the co-administered molecules or 

30 compositions {e.g., multiple administrations of one or more of the expression cassettes 
described herein followed by one or more administrations of a polypeptide-containing 
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composition). In cases where the molecules or compositions are delivered 
sequentially, the time between each administration can be readily determined by one 
of skill in the art in view of the teachings herein. 

"T lymphocytes" or "T cells" are non-antibody producing lymphocytes that 
5 constitute a part of the cell-mediated arm of the immune system. T cells arise from 
immature lymphocytes that migrate from the bone marrow to the thymus, where they 
undergo a maturation process under the direction of thymic hormones. Here, the 
mature lymphocytes rapidly divide increasing to very large numbers. The maturing T 
cells become immunocompetent based on their ability to recognize and bind a specific 
10 antigen. Activation of immunocompetent T cells is triggered when an antigen binds to 
the lymphocyte's surface receptors. 

2.0.0 Modes of Carrying Out the Invention 

Before describing the present invention in detail, it is to be understood that this 
15 invention is not limited to particular formulations or process parameters as such may, 
of course, vary. It is also to be understood that the terminology used herein is for the 
purpose of describing particular embodiments of the invention only, and is not 
intended to be limiting. 

Although a number of methods and materials similar or equivalent to those 
20 described herein can be used in the practice of the present invention, the preferred 
materials and methods are described herein. 

2.1.0 General Overview of the Invention 

The present invention relates to combination approaches to generate immune 
25 responses in subjects using compositions comprising immunogenic polynucleotides 
and polypeptides. 

In one general aspect of the present invention, a polynucleotide component of 
the present invention consists essentially of one polynucleotide encoding a 
immunogenic polypeptide derived from a microorganism (e.g., virus, bacteria, fungi, 
30 etc.), and a polypeptide component that comprises one or more immunogenic 

polypeptides analogous to the polypeptide encoded by said polynucleotide component, 
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with the proviso that at least one immunogenic polypeptide of the polypeptide 
component is derived from a different subtype, serotype, or strain of the 
microorganism than the coding sequence of the immunogenic polypeptide encoded by 
the polynucleotide component. In this context, the polynucleotide component 
5 consisting essentially of one polynucleotide encoding an immunogenic polypeptide 
refers to the presence of one polynucleotide encoding one immunogenic polypeptide 
in the composition. The polynucleotide composition may comprise further 
components, such as immune enhancers, immunoregulatory components, vector 
sequences (e.g., viral or non-viral), carriers, particles, excipients, expression control 

10 sequences, etc. In addition, the polynucleotide component may include further 
components such as molecules to enhance the immune response (e.g., liposomes, 
PLG, particles, alum, etc.). Further, the polypeptide component may comprise further 
components, such as, immune enhancers, immunoregulatory components, adjuvants, 
carriers, particles, excipients, etc. In a further embodiment of this composition, the 

1 5 polynucleotide component does not encode an analogous HIV immunogenic 
polypeptide derived from any subtype other than the first subtype, and the 
polypeptide component does not comprise an analogous HIV immunogenic 
polypeptide derived from any subtype other than the first subtype. 

20 In a second general aspect of the present invention, a polynucleotide 

component comprises two or more polynucleotide sequences comprising coding 
sequences for two or more analogous immunogenic polypeptides derived from a 
microorganism (e.g., virus, bacteria, fungi, etc.), wherein the coding sequences for at 
least two of the immunogenic polypeptides are derived from different subtypes, 

25 serotypes, or strains of the microorganism, and the polypeptide component comprises 
one or more immunogenic polypeptides analogous to the polypeptide encoded by said 
polynucleotide component, with the proviso that (i) if the polypeptide component 
provides less than the number of analogous immunogenic polypeptides encoded by the 
polynucleotide component, then the immunogenic polypeptides of the polypeptide 

30 composition may be derived from the same and/or different subtypes, serotypes, or 
strains, as the immunogenic polypeptides provided by the polynucleotide component, 
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or (ii) if the polypeptide component provides the same or greater than the number of 
analogous immunogenic polypeptides encoded by the polynucleotide component, then 
the immunogenic polypeptides of the polypeptide composition are derived from at 
least one different subtype, serotype, or strain than the immunogenic polypeptides 
5 provided by the polynucleotide component. The polynucleotide composition may 
comprise further components, such as immune enhancers, immunoregulatory 
components, vector sequences (e.g., viral or non-viral), carriers, particles, excipients, 
expression control sequences, etc. In addition, the polynucleotide component may 
include further components such as molecules to enhance the immune response (e.g., 

10 liposomes, PLG, particles, alum, etc.). Further, the polypeptide component may 
comprise further components, such as, immune enhancers, immunoregulatory 
components, adjuvants, carriers, particles, excipients, etc. 

The invention is exemplified herein with reference to Human 
Immunodeficiency Virus 1 (HTV-1). One of ordinary skill in the art, in view of the 

15 teachings of the present specification, can apply the teachings of the present invention 
to other suitable organisms, for example, microorganisms. The compositions and 
methods of the present invention may, for example, employ polynucleotides encoding 
HIV envelope polypeptides and well as HIV envelope polypeptides, e.g., HIV 
envelope proteins analogous to those encoded by the polynucleotides, to induce broad 

20 and/or potent neutralizing activity against diverse HIV strains. Although described 
with reference to the HIV virus, the compositions and methods of the present 
invention can be applied to other virus families having a variety of subtypes, 
serotypes, and/or strain variations, for example, including but not limited to other non- 
HIV retroviruses (e.g. HTLV-1, 2), hepadnoviruses (e.g. HBV), herpesviruses (e.g. 

25 HSV-1, 2, CMV, EBV, varizella-zoster, etc.), flaviviruses (e.g. HCV, Yellow fever, 
Tick borne encephalitis, St. Louis Encephalitis, West Nile Virus, etc.), coronaviruses 
(e.g. SARS), paramyxoviruses (e.g., PIV, RSV, measles etc.), influenza viruses, 
picornaviruses, reo viruses (e.g., rotavirus), arenaviruses, rhabdoviruses, 
papovaviruses, parvoviruses, adenoviruses, Dengue virus, bunyaviruses (e.g. , 

30 hantavirus), calciviruses (e.g. Norwalk virus), filoviruses (e.g. , Ebola, Marburg). 



33 



PP21 100.001 
PATENT 



The diversity and mutability of the HIV virus present challenges to HIV 
vaccine development. HIV continues to spread globally, with upwards of 42 million 
people infected with HIV (UNAIDS Report on the global HIV/ AIDS epidemic, 
UNAIDS, Geneva, Switzerland (December 2002). These people are infected with 
5 different HIV subtypes (and/or strains). The infecting HIV subtype (and/or strain) is 
typically geographically dependent. In one aspect, the present invention relates to 
compositions and methods that provide the ability to induce broad and potent 
neutralizing antibodies against the diverse HIV subtypes, serotypes, and/or strains for 
the treatment of infections, reduction of infection risk, reduction of transmission, 
10 reduction of disease manifestations, and/or prevention of HIV infections arising in 
different regions. 

Experiments performed in support of the present invention confirm the use of 
the combination approaches described herein to induce potent and broad HIV- 
neutralization activity. The approaches include immunization with a variety of 
1 5 polynucleotides encoding HIV polypeptides derived from different subtypes, 

serotypes, or strains combined with immunization using HIV polypeptides derived 
from different subtypes, serotypes, or strains. The invention further includes 
immunization using various doses and immunization regimens of such polynucleotides 
and polypeptides. 

20 Accordingly, in a first particular aspect of the present invention, the 

polynucleotide component of the present invention consists essentially of one 
polynucleotide encoding an HIV immunogenic polypeptide, and the polypeptide 
component comprises of one or more HIV immunogenic polypeptides analogous to 
the polypeptide encoded by said polynucleotide component, with the proviso that at 

25 least one HIV immunogenic polypeptide of the polypeptide component is derived 
from a different HIV subtype, serotype, or strain than the coding sequence of the 
immunogenic polypeptide encoded by the polynucleotide component. In this context, 
consists essentially of refers to the presence of one polynucleotide sequence encoding 
one HIV immunogenic polypeptide in the polynucleotide composition. The 

30 polynucleotide composition may comprise further components, such as immune 

enhancers, immunoregulatory components, vector sequences (e.g., viral or non- viral), 
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carriers, particles, excipients, expression control sequences, etc. In one embodiment 
of the present invention, the HIV immunogenic polypeptide encoded by the 
polynucleotide component is derived from subtype B, and at least one coding 
sequence of an HIV immunogenic polypeptide of the polypeptide component is 
5 derived from subtype C. In another embodiment, the HIV immunogenic polypeptide 
encoded by the polynucleotide component is derived from a first strain of a first 
subtype (e.g., a first subtype B strain), and at least one coding sequence of an HIV 
immunogenic polypeptide of the polypeptide component is derived from a second 
strain of the first subtype (e.g., a second subtype B strain). 

10 In one embodiment, a polynucleotide and a polypeptide from different HIV 

subtypes, serotypes, or strains are used for priming and boosting, i.e., a polynucleotide 
encoding an immunogenic HIV polypeptide is used for immunization via delivery of 
the polynucleotide (e.g., a prime), an analogous immunogenic HIV polypeptide 
derived from a different HIV subtype, serotype, or strain is used for immunization 

1 5 (e.g., a boost). For example, a polynucleotide molecule is used for nucleic acid 

immunization, wherein the polynucleotide molecule encodes an HIV gpl40 envelope 
polypeptide (i) derived from a South African HIV subtype C isolate/strain, (ii) that is 
codon optimized for expression in mammalian cells, and (iii) is mutated by deletion of 
the V2 loop (e.g., gpl40mod.TVl.delV2, as described for example in PCT 

20 International Publication No. WO/02/04493). This nucleic acid immunization is 
followed by a protein boost using an HIV gpl40 envelope polypeptide (i) derived 
from a North American HIV subtype B isolate/strain, and (ii) is mutated by deletion of 
the V2 loop (e.g., the protein product of gpl40.mut7.modSF162.delV2, as described 
for example in PCT International Publication No. WO/00/39302). Oligomeric forms 

25 of the envelope polypeptide may be used (e.g., o-gpl40 as described in PCT 

International Publication No. WO/00/39302 and US Patent No. 6,602,705). One 
embodiment of this aspect of the present invention, comprises a composition for 
generating an immune response in a mammal, the composition comprising: a 
polynucleotide component, comprising, a first polynucleotide encoding a first HIV 

30 immunogenic polypeptide; and a polypeptide component, comprising a second HIV 
immunogenic polypeptide,, wherein said first and second immunogenic HIV 
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polypeptide are derived from different HIV subtypes, serotypes, or strains, and (ii) 
said first and second immunogenic polypeptides encode analogous HIV polypeptides. 
In one embodiment of the present invention, the analogous HIV immunogenic 
polypeptides coding sequences that comprise the polynucleotide composition and the 
5 HIV immunogenic polypeptides that comprise the polypeptide component of the 
present invention may be derived from different subtypes of HIV, in another 
embodiment they may derived from different strains of HIV from the same HIV 
subtype. In another embodiment of this aspect of the present invention the 
polynucleotide and polypeptide components of the present invention are used to 

10 broadly raise neutralizing antibodies against viral strains that use the CCR5 coreceptor 
for cell entry. For example, a composition for generating neutralizing antibodies in a 
mammal may comprise, a polynucleotide component consisting essentially of one 
polynucleotide encoding an HIV immunogenic polypeptide derived from an HIV 
strain that uses the CCR5 coreceptor for cell entry, and a polypeptide component 

1 5 comprising one or more HIV immunogenic polypeptides derived from an HIV strain 
that uses the CCR5 coreceptor for cell entry analogous to the polypeptide encoded by 
said polynucleotide component, with the proviso that (i) if the polypeptide component 
has only one HIV immunogenic polypeptide, then the coding sequence of the HIV 
immunogenic polypeptide of the polypeptide component is derived from a different 

20 HIV strain that uses the CCR5 coreceptor for cell entry than the coding sequence of 
the immunogenic polypeptide encoded by the polynucleotide component, or (ii) if the 
polypeptide component comprises greater than one HIV immunogenic polypeptide, 
then the coding sequences of the polypeptides of the polypeptide component are 
derived from more than one HIV strain that uses the CCR5 coreceptor for cell entry. 

25 In second particular aspect of the present invention, the polynucleotide 

component comprises two or more polynucleotide sequences comprising coding 
sequences for two or more analogous HIV immunogenic polypeptides, wherein the 
coding sequences for at least two of the HIV immunogenic polypeptides are derived 
from different HIV subtypes, serotypes, or strains, and the polypeptide component 

30 comprises one or more HIV immunogenic polypeptides analogous to the polypeptide 
encoded by said polynucleotide component, with the proviso that (i) if the polypeptide 
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component provides less than the number of analogous HIV immunogenic 
polypeptides encoded by the polynucleotide component, then the HIV immunogenic 
polypeptides of the polypeptide composition may be derived from the same and/or 
different HIV subtypes, serotypes, or strains as the HIV immunogenic polypeptides 
5 provided by the polynucleotide component, or (ii) if the polypeptide component 
provides the same or greater than the number of analogous HIV immunogenic 
polypeptides encoded by the polynucleotide component, then at least one of the HIV 
immunogenic polypeptides of the polypeptide composition is derived from a different 
HIV subtype, serotype, or strain than the HIV immunogenic polypeptides provided by 

10 the polynucleotide component. 

In one embodiment of the present invention, two or more polynucleotides 
encoding immunogenic HIV polypeptides, derived from at least two different 
subtypes, serotypes, or strains are mixed (e.g., in equal amounts) for priming. Then a 
single, analogous, immunogenic HIV polypeptide derived from one of the subtypes, 

15 serotypes, or strains used for priming is used for boosting. A more general 
embodiment comprises a composition for generating an immune response in a 
mammal, said composition comprising: a polynucleotide component, comprising, two 
or more polynucleotides each encoding analogous HIV immunogenic polypeptides, 
with the proviso that the coding sequences of each HIV immunogenic polypeptide are 

20 derived from different HIV subtypes, serotypes, or strains; and a polypeptide 

component, comprising one or more HIV immunogenic polypeptides, with the proviso 
that said polypeptide component comprises at least one less HIV immunogenic 
polypeptide than encoded by said polynucleotide component. For example, two DNA 
molecules are used for nucleic acid immunization, wherein the first DNA molecule 

25 encodes an HIV gpl40 envelope polypeptide (i) derived from a South African HIV 
subtype C isolate/strain, (ii) that is codon optimized for expression in mammalian 
cells, and (iii) is mutated by deletion of the V2 loop (e.g., gpl40mod.TVl.delV2, as 
described for example in PCT International Publication No. WO/02/04493), and the 
second DNA molecule encodes an HIV gpl40 envelope polypeptide (i) derived from a 

30 North American HIV subtype B isolate, (ii) that is codon optimized for expression in 
mammalian cells, and (iii) is mutated by deletion of the V2 loop (e.g., 
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gpl40.modSF162.delV2, as described for example in PCT International Publication 
No. WO/00/39302). This DNA immunization is followed by a protein boost using a 
single HIV gpl40 envelope polypeptide (i) derived from a North American HIV 
subtype B isolate, and (ii) is mutated by deletion of the V2 loop (e.g., the protein 
5 product of gpl40.mut7.modSF162.delV2, as described for example in PCT 

International Publication No. WO/00/39302). Oligomeric forms of the envelope 
polypeptide may be used (e.g., o-gpl40 as described in PCT International Publication 
No. WO/00/39302). One embodiment of a composition for generating an immune 
response in a mammal comprises, a polynucleotide component comprising a first 

10 polynucleotide encoding a first immunogenic HIV polypeptide, and a second 

polynucleotide encoding a second immunogenic HIV polypeptide, wherein (i) said 
first and second immunogenic HIV polypeptide are derived from different HIV 
subtypes, serotypes, or strains, and (ii) said first and second immunogenic 
polypeptides encode analogous HIV polypeptides, and a polypeptide component 

15 comprising said first HIV immunogenic polypeptide, or said second HIV 
immunogenic polypeptide, with the proviso that said polypeptide component 
comprises at least one less HIV immunogenic polypeptide than is encoded by said 
polynucleotide component. In a preferred embodiment, polynucleotides encoding 
analogous immunogenic HIV polypeptides, derived from a variety of different HIV 

20 subtypes, serotypes, or strains are used for a prime immunization, and a single 
analogous immunogenic HIV polypeptide is used for one or more protein boost. 

In another embodiment, two or more polynucleotides encoding immunogenic 
HIV polypeptides, derived from at least two different subtypes, serotypes, or strains 
are mixed (e.g., in equal amounts) for priming. Then one or more analogous, 

25 immunogenic HIV polypeptides derived from at least two different subtypes, 

serotypes, or strains are used for boosting, wherein at least one of the immunogenic 
HP/ polypeptides is derived from a subtype, serotype, or strain not represented in the 
polynucleotide component. For example, the polynucleotide component comprises 
three polynucleotides encoding three immunogenic HIV polypeptides, one coding 

30 sequence derived from a subtype B strain, one coding sequence derived from a 

subtype C strain, and one coding sequence derived from a subtype E strain, and the 



38 



PP2 1100.001 
PATENT 

polypeptide component comprises three immunogenic HIV polypeptides, one coding 
sequence derived from a subtype B strain, one coding sequence derived from a 
subtype C strain, and one coding sequence derived from a subtype O strain. In another 
embodiment of this aspect of the present invention, the polynucleotides of the 
5 polynucleotide component comprises polynucleotides encoding analogous HIV 
immunogenic polypeptides from different subtypes, serotypes, or strains as the 
polypeptides of the polypeptide component. For example, DNA immunization with 
two or more DNA molecules encoding HIV gpl40 polypeptides (wherein the two or 
more gpl40 coding sequences are derived from two or more HIV-1 subtypes, 

10 serotypes, or strains). The polypeptide component, use for protein immunization, 
comprises two or more gpl40 polypeptides (wherein the two or more gpl40 coding 
sequences are derived from two or more HIV-1 subtypes, serotypes, or strains, with 
the proviso that at least one of the polypeptide sequences is derived from an HIV-1 
subtype, serotype, or strain not represented in the DNA component). 

15 In another embodiment, the polynucleotide component comprises two or more 

polynucleotide sequences comprising coding sequences for two or more analogous 
HIV immunogenic polypeptides, wherein the coding sequences for at least two of the 
HIV immunogenic polypeptides are derived from different HIV strains that use the 
CCR5 coreceptor for cell entry, and the polypeptide component comprises one or 

20 more HIV immunogenic polypeptides analogous to the polypeptide encoded by said 
polynucleotide component, with the proviso that (i) if the polypeptide component 
provides less than the number of analogous HIV immunogenic polypeptides encoded 
by the polynucleotide component, then the HIV immunogenic polypeptides of the 
polypeptide composition may be derived from the same and/or different HIV strains 

25 that use the CCR5 coreceptor for cell entry as the HIV immunogenic polypeptides 
provided by the polynucleotide component, or (ii) if the polypeptide component 
provides the same or greater than the number of analogous HIV immunogenic 
polypeptides encoded by the polynucleotide component, then at least one of the HIV 
immunogenic polypeptides of the polypeptide composition is derived from a different 

30 HIV strain that uses the CCR5 coreceptor for cell entry than the HIV immunogenic 
polypeptides provided by the polynucleotide component. 



39 



PP21 100.001 
PATENT 



In a further aspect, the present invention relates to the use of varied doses of 
polynucleotides and polypeptides in prime/boost methods, particularly the methods 
described herein. In any immunization method using, for example, a mixed 
polynucleotide prime (i.e., two or more polynucleotides encoding immunogenic HIV 
5 polypeptides derived from two or more HIV subtypes, serotypes, or strains) in 
conjunction with a polypeptide boost the present invention includes using reduced 
doses of each single component to provide an equivalent immune response to using 
full doses of each component. In one embodiment, the high threshold of DNA is the 
maximum tolerable dose of DNA (e.g., about 5 mg to about 10 mg total DNA), the 

10 low threshold of DNA is the minimum effective dose (e.g., about 2 ug to about 10 ug 
total DNA), the high threshold of protein is the maximum tolerable dose of protein 
(e.g., about 1 mg total protein), the low threshold of protein is the minimum effective 
dose (e.g., about 2 ug total protein). Experiments performed in support of the present 
invention demonstrated that the total DNA dose may be divided among the 

15 polynucleotides of the polynucleotide component (for example, four polynucleotide 
constructs used, the total DNA for all four is less than or equal to the high threshold) 
(e.g., Example 4). Further, the total polypeptide dose may be divided among the 
polypeptides comprising the polypeptide component (for example, four polypeptides 
used, the total protein for all four is less than or equal to the high threshold) (e.g., 

20 Example 4). The total DNA and total protein are both typically above the low 
threshold values. 

In a preferred embodiment, the total amount of DNA in a given DNA 
immunization has a high threshold of less than or equal to about 10 mg total DNA and 
greater than or equal to 1 mg total DNA, and the total amount of protein in a given 

25 polypeptide boost has a high threshold of less than or equal to about 200 ug total 

protein product and greater than or equal to 10 ug of total protein. For example, in an 
embodiment using a polynucleotide component having two DNA molecules each 
encoding an immunogenic HIV polypeptide the dose of each DNA molecule per 
subject may be one milligram of each DNA molecule encoding an immunogenic HIV 

30 polypeptide, for a total of 2 mg for the two DNA molecules, or 0.5 mg of each DNA 
molecule encoding an immunogenic HIV polypeptide, for a total of 1 mg for the two 
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DNA molecules. Dosing with the polypeptide component may be similarly varied, for 
example, using a polypeptide component having two immunogenic HIV polypeptides 
the dose of each polypeptide per subject may be 100 micrograms of each 
immunogenic HIV polypeptide, for a total of 200 ug for the two polypeptides, 50 
5 micrograms of each immunogenic HIV polypeptide, for a total of 100 ug for the two 
polypeptides, or 25 ug of each immunogenic HIV polypeptide, for a total of 50 ug for 
the two polypeptides. As described above, more than two polypeptides may be 
included in the polypeptide component of the present invention. 

In one embodiment of this aspect of the present invention, the polynucleotides 

10 of the polynucleotide component encode analogous HIV immunogenic polypeptides 
from the same subtypes, serotypes, or strains as the polypeptides of the polypeptide 
component. For example, two DNA molecules are used for nucleic acid 
immunization, wherein the first DNA molecule encodes an HIV gpl40 envelope 
polypeptide (i) derived from a South African HIV subtype C isolate, (ii) that is codon 

1 5 optimized for expression in mammalian cells, (iii) is mutated by deletion of the V2 
loop (e.g., gpl40mod.TVl.delV2, as described for example in PCT International 
Publication No. WO/02/04493), and (iv) is delivered at 0.5 mg, and the second DNA 
molecule encodes an HIV gpl40 envelope polypeptide (i) derived from a North 
American HIV subtype B isolate, (ii) that is codon optimized for expression in 

20 mammalian cells, (iii) is mutated by deletion of the V2 loop (e.g., 

gpl40.modSF162.delV2, as described for example in PCT International Publication 
No. WO/00/39302), and (iv) is delivered at 0.5 mg. This DNA immunization is 
followed by a protein boost using an HIV gpl40 envelope polypeptide (i) derived 
from a South African HIV subtype C isolate, (ii) is mutated by deletion of the V2 loop 

25 (e.g., the protein product of gp 140mod.TVl .mut7.delV2, as described for example in 
PCT International Publication No. WO/02/04493), and (iii) is delivered at 50 ug 
protein, and an HIV gpl40 envelope polypeptide (i) derived from a North American 
HIV subtype B isolate, (ii) is mutated by deletion of the V2 loop (e.g., the protein 
product of gpl40.mut7.modSF162.delV2, as described for example in PCT 

30 International Publication No. WO/00/39302), and (iii) is delivered at 50 ug protein. 



41 



PP21 100.001 
PATENT 



Further, oligomeric forms of the envelope polypeptide may be used (e.g., o-gpl40 as 
described in PCT International Publication No. WO/00/39302). 

In further embodiments, the polynucleotide component of the present invention 
may comprise one or more gene delivery vectors comprising the polynucleotide(s) 
5 encoding immunogenic HIV polypeptide(s). Further components that may be 
included in the polynucleotide component are described herein. The polypeptide 
component of the present invention may comprise an adjuvant in addition to the 
immunogenic polypeptide(s). Further components that may be included in the 
polypeptide component are described herein. 

10 The present invention also comprises methods for generating an immune 

response in a subject. In one general aspect, the method comprises administering to a 
subject a first component providing an immunogenic polypeptide and administering to 
a subject a second component providing a different but analogous immunogenic 
polypeptide. The first component and the second component may be polynucleotide 

1 5 components or polypeptide components. The immunogenic polypeptides may be 
provided either directly (as in a polypeptide component) or indirectly (as in a 
polynucleotide component). In a preferred embodiment, one of the components 
(either first or second component) is a polynucleotide component, and the other 
component (either second or first component) is a polypeptide component. Preferably, 

20 the polypeptide immunogens provided by the first and second components are 

analogous HIV immunogenic polypeptides. The first and second components may be 
administered simultaneously or may be administered at separate times. Preferably, the 
first and second components are administered in a prime-boost regimen. Various 
prime-boost regimens have been described in the art and are well known to those of 

25 ordinary skill. In a typical prime-boost regimen, a first component providing a 

polypeptide immunogen is administered to a subject; the initial immune response is 
followed by determining the production of binding antibodies to the polypeptide 
immunogen in said subject until the titer of binding antibodies begins to decline; and a 
second component providing a different but related polypeptide immunogen is 

30 administered to the subject. 
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The first and second components may be provided as a composition. In a 
particular aspect the method comprises, providing a composition of the present 
invention for generating an immune response in a mammal, administering one or more 
gene delivery vectors comprising the polynucleotides of the polynucleotide component 
5 of the composition into the subject under conditions that are compatible with 

expression of the polynucleotides in the subject for the production of encoded HIV 
immunogenic polypeptides, and administering the polypeptide component to the 
subject. The administering of the polynucleotide and polypeptide compositions may be 
concurrent or sequentially. In a preferred embodiment immunization with the 

10 polynucleotide component precedes immunization with the polypeptide component. 
Further, a single prime may be followed by multiple boosts, multiple primes may be 
followed by a single boost, multiple primes may be followed by multiple boosts, or a 
series of primes and boosts may be used. The polynucleotide component may 
comprise further components (e.g., components for enhancing immune response, 

15 carriers, etc.). The polypeptide component may comprise further components (e.g., 
components for enhancing immune response, carriers, etc.). 

Exemplary polynucleotide constructs, methods of making the polynucleotide 
constructs, corresponding polypeptide products, and methods of making polypeptides 
useful for HIV immunization have been previously described, for example, in the 

20 following PCT International Publication Nos.: WO/00/39302; WO/00/39303; 

WO/00/39304; WO/02/04493; WO/03/004657; WO/03/004620; and WO/03/020876. 

Although described generally with reference to HIV subtypes B and C as 
exemplary subtypes, the compositions and methods of the present invention are 
applicable to a wide variety of HIV subtypes, serotypes, or strains and immunogenic 

25 polypeptides encoded thereby, including but not limited to the previously identified 
HTV-l subtypes A through K, N and O, the identified CRFs (circulating recombinant 
forms), and HIV-2 strains and its subtypes. See, e.g., Myers, et al., Los Alamos 
Database, Los Alamos National Laboratory, Los Alamos, New Mexico; Myers, et al., 
Human Retroviruses and Aids, 1990, Los Alamos, New Mexico: Los Alamos National 

30 Laboratory. Further, the compositions and methods of the present invention may be 
used to raise broadly reactive neutralizing antibodies against viral strains and subtypes 
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that use the CCR5 coreceptor for cell entry (for example, both TV1 and SF162 use the 
CCR5 coreceptor (Example 4)). 

The polypeptide component of the present invention may comprise fragments 
of immunogenic polypeptide, for example, wherein the polypeptide sequence or a 
5 portion thereof contains an amino acid sequence of at least 3 to 5 amino acids, more 
preferably at least 8 to 10 amino acids, and even more preferably at least 15 to 20 
amino acids from a polypeptide encoded by the nucleic acid sequence. Also 
encompassed are polypeptide sequences that are immunologically identifiable with a 
polypeptide encoded by the sequence. Further, polyproteins can be constructed by 
10 fusing in- frame two or more polynucleotide sequences encoding polypeptide or 
peptide products. 

In addition, the polynucleotide component of the present invention may 
comprise one or more monocistronic expression cassettes comprising polynucleotides 
encoding immunogenic HIV polypeptides, or one or more polycistronic expression 

15 cassettes comprising polynucleotides encoding immunogenic HIV polypeptides, or 
combinations thereof. Polycistronic coding sequences may be produced, for example, 
by placing two or more polynucleotide sequences encoding polypeptide products 
adjacent each other, typically under the control of one promoter, wherein each 
polypeptide coding sequence may be modified to include sequences for internal 

20 ribosome binding sites. 

A variety of combinations of polynucleotides encoding immunogenic 
polypeptides (e.g., HIV immunogenic polypeptides) and immunogenic polypeptides or 
fragments thereof (e.g., HIV immunogenic polypeptides) can be used in the practice of 
the present invention. Polynucleotide sequences encoding immunogenic polypeptides 

25 can be included in a polynucleotide component of compositions of the present 

invention, for example, as DNA immunization constructs containing, for example, a 
synthetic Env expression cassettes, a synthetic Gag expression cassette, a synthetic 
pol-derived polypeptide expression cassette, a synthetic expression cassette 
comprising sequences encoding one or more accessory or regulatory genes (e.g., tat, 

30 rev, nef, vif, vpu, vpr). Immunogenic polypeptides may be included as purified 

polypeptides in the polypeptide component of compositions of the present invention. 



44 



PP21 100.001 
PATENT 



The immunogenic polypeptides may be synthetic or wild-type. In preferred 
embodiments the immunogenic polypeptides are antigenic viral proteins, or fragments 
thereof. 

5 2.2.0 Identification of Analogous Polypeptides and Polynucleotides 
Encoding Such Polypeptides 

The compositions and methods of the present invention are described with 
reference to exemplary HIV-1 sequences. The present invention is not limited to the 
sequences described herein. Numerous sequences for use in the practice of the present 

10 invention have been previously described (see, e.g., PCT International Publication 
Nos. WO/00/39302; WO/00/39303; WO/00/39304; WO/02/04493; WO/03/004657; 
WO/03/004620; and WO/03/020876.). Typically, the polynucleotide sequences used 
in the practice of the present invention encode polypeptides derived from a viral 
source (e.g., HIV-1). The polypeptides are typically derived from antigenic viral 

1 5 proteins, in particular, group specific antigen polypeptides, envelope polypeptides, 
capsid polypeptides, and other structural and non-structural polypeptides. The present 
invention is particularly described with reference to the use of envelope polypeptides 
and modifications thereof (and polynucleotides encoding same) derived from various 
subtypes, serotypes, or strains of the HIV-1 virus. Other HIV-1 polypeptides and 

20 polynucleotides encoding such polypeptides may be used in the practice of the present 
invention including, but not limited to, Gag, Pol (including Protease, Reverse 
Transcriptase, and Integrase), Tat, Rev, Nef, Vif, Vpr, and Vpu. 

The HIV genome and various polypeptide-encoding regions are shown in 
Table 1. The nucleotide positions are given relative to an HIV-1 Subtype C isolate 

25 from South Africa strain 8_5_TV1_C.ZA (Figures 1A-1D). However, it will be 
readily apparent to one of ordinary skill in the art in view of the teachings of the 
present disclosure how to determine corresponding regions in other HIV strains (from 
the same or different subtypes) or variants (e.g., isolates HTVni b , HIVsf2 5 HTV-1sfi62, 
HIV-1sfi7o, HP/lav, HIV LA i, HIVmn, HIV-1cm235„ HIV-1us4, other HIV-1 strains 

30 from diverse subtypes(e.g., subtypes, A through K, N and O), the identified CRFs 

(circulating recombinant forms), HIV-2 strains and diverse subtypes and strains (e.g., 
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HIV-2uci and HIV-2 UC 2), and simian immunodeficiency virus (SIV). (See, e.g., 
Virology, 3rd Edition (W.K. Joklik ed. 1988); Fundamental Virology, 2nd Edition 
(B.N. Fields and D.M. Knipe, eds. 1991); Virology, 3rd Edition (Fields, BN, DM 
Knipe, PM Howley, Editors, 1996, Lippincott-Raven, Philadelphia, PA; for a 
5 description of these and other related viruses), using for example, sequence 

comparison programs {e.g., BLAST and others described herein) or identification and 
alignment of structural features (e.g., a program such as the "ALB" program described 
herein that can identify the various regions). 



Regions of the HIV Genome relative to the Sequence of 8_5_TV1_C.ZA 



Region 


Position in nucleotide sequence 


5'LTR 


1-636 




1-457 


R 


458-553 


U5 


554-636 


NFkBII 


340-348 


NFkBI 


354-362 


Spl III 


379-388 


Spl II 


390-398 


Spl I 


400-410 


TATA Box 


429-433 


TAR 


474-499 


Poly A signal 


529-534 


PBS 


638-655 


p7 binding region, packaging signal 


685-791 


Gag: 


792-2285 


pl7 


792-1178 
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Region 


Position in nucleotide sequence 


p24 


1179-1871 


Cyclophilin A bdg. 


1395-1505 


MHR 


1632-1694 


p2 


1872-1907 


P7 


1908-2072 


Frameshift slip 


2072-2078 


pi 


2073-2120 


p6Gag 


2121-2285 


Zn-motifl 


1950-1991 


Zn-motifll 


2013-2054 


Pol: 


2072-5086 


p6Pol 


2072-2245 


Prot 


2246-2542 


p66RT 


2543-4210 


pl5RNaseH 


3857-4210 


p31Int 


4211-5086 


Vif: 


5034-5612 


Hydrophilic region 


5292-5315 


Vpr: 


5552-5839 


Oligomerization 


5552-5677 


Amphipathic a-helix 


5597-5653 


Tat: 


5823-6038 and 8417-8509 


Tat-1 exon 


joZJ-OUio 


Tat-2 exon 


8417-8509 


N-terminal domain 


5823-5885 


Trans-activation domain 


5886-5933 
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Region 


Position in nucleotide sequence 


Transduction domain 


5961-5993 


Rev: 


5962-6037 and 8416-8663 


Rev-1 exon 


5962-6037 


Rev-2 exon 


8416-8663 


High-affinity bdg. site 


8439-8486 


Leu-rich effector domain 


8562-8588 


Vpu: 


6060-6326 


Transmembrane domain 


6060-6161 


Cytoplasmic domain 


6162-6326 


Env (gpl60): 


6244-8853 


Signal peptide 


6244-6324 


gpl20 


6325-7794 


VI 


6628-6729 


V2 


6727-6852 


V3 


7150-7254 


V4 


7411-7506 


V5 


7663-7674 


CI 


6325-6627 


C2 


6853-7149 


C3 


7255-7410 


C4 


7507-7662 


C5 


7675-7794 


CD4 binding 


7540.-7566 


gp41 


/ /yb-ooji 


Fusion peptide 


7789-7842 


Oligomerization domain 


7924-7959 


N-terminal heptad repeat 


7921-8028 
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Region 


Position in nucleotide sequence 


C-terminal heptad repeat 


8173-8280 


Immunodominant region 


8023-8076 


Nef: 


8855-9478 


Myristoylation 


oof o OC7C 
OOJO-58 /J 


SH3 binding 


9062-9091 


Polypurine tract 


9128-9154 


SH3 binding 


9296-9307 



It will be readily apparent that one of skill in the art can align any HIV 
sequence to that shown in Table 1 to determine relative locations of any particular 
HIV gene. For example, using one of the alignment programs described herein (e.g., 
BLAST), other HIV genomic sequences can be aligned with 8_5_TV1_C.ZA (Table 
5 1) and locations of genes determined. Polypeptide sequences can be similarly aligned. 
For example, Figures 2A-2E shows the alignment of Env polypeptide sequences from 
various strains, relative to SF-162. As described in detail in PCT International 
Publication No. WO/00/39303, Env polypeptides (e.g., gpl20, gpl40 and gpl60) 
include a "bridging sheet" comprised of 4 anti-parallel beta-strands (beta-2, beta-3, 

10 beta -20 and beta -21) that form a beta -sheet. Extruding from one pair of the beta - 
strands (beta -2 and beta -3) are two loops, VI and V2. The beta -2 sheet occurs at 
approximately amino acid residue 113 (Cys) to amino acid residue 117 (Thr) while 
beta -3 occurs at approximately amino acid residue 192 (Ser) to amino acid residue 
194 (He), relative to SF-162. The "V1/V2 region" occurs at approximately amino acid 

15 positions 120 (Cys) to residue 189 (Cys), relative to SF-162. Extruding from the 
second pair of beta -strands (beta -20 and beta -21) is a "small-loop" structure, also 
referred to herein as "the bridging sheet small loop." The locations of both the small 
loop and bridging sheet small loop can be determined relative to HXB-2 following the 
teachings herein and in PCT International Publication No. WO/00/39303. Also 

20 shown by arrows in Figures 2A-2E are approximate sites for deletions sequence from 
the beta sheet region. The "*" denotes N-glycosylation sites that can be mutated 
following the teachings of the present specification. 
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2.3.0 Expression Cassettes Comprising Polynucleotide Sequences, 

Vectors, Polypeptides, Further Components, and Formulations 
Useful in the Practice of the Present Invention 

5 Compositions for the generation of immune responses of the present invention 

comprise a polynucleotide component and a polypeptide component. The 
polynucleotide component of may comprise one or more polynucleotides encoding 
immunogenic viral polypeptides. Such polynucleotides may comprise native viral 
sequences encoding immunogenic viral polypeptides or synthetic polynucleotides 

1 0 encoding immunogenic polypeptides. Synthetic polynucleotides may include 

sequence optimization to provide improved expression of the encoded polypeptides 
relative to the analogous native polynucleotide sequences. Further, synthetic 
polynucleotides may comprise mutations (single or multiple point mutations, missense 
mutations, nonsense mutations, deletions, insertions, etc.) relative to corresponding 

1 5 wild-type sequences. 

The polypeptide component of the compositions of the present invention may 
comprise one or more immunogenic viral polypeptide. Such polypeptides may 
comprise native immunogenic viral polypeptides or modified immunogenic 
polypeptides. Modified polypeptides may include sequence optimization to provide 

20 improved expression of the polypeptides relative to the analogous native 

polynucleotide sequences. Further, modified polypeptides may comprise mutations 
(single or multiple point mutations, missense mutations, nonsense mutations, 
deletions, insertions, etc.) relative to corresponding wild-type sequences. 

The compositions of the present invention, comprising a polynucleotide 

25 component and a polypeptide component, are described with reference to HIV-1 

derived sequences. However, the compositions and methods of the present invention 
are applicable to other types of viruses as well, wherein such viruses comprise 
multiple subtypes, serotypes, and/or strain variations, for example, including but not 
limited to other non-HIV retroviruses (e.g. HTLV-1, 2), hepadnoviruses (e.g. HBV), 

30 herpesviruses (e.g. HSV-1, 2, CMV, EBV, varizella-zoster, etc.), flaviviruses (e.g. 

HCV, Yellow fever, Tick borne encephalitis, St. Louis Encephalitis, West Nile Virus, 
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etc.), coronaviruses (e.g. SARS), paramyxoviruses (e.g., PIV, RSV, measles etc.), 
influenza viruses, picornaviruses, reoviruses (e.g., rotavirus), arenaviruses, 
rhabdoviruses, papovaviruses, parvoviruses, adenoviruses, Dengue virus, bunyaviruses 
(e.g. , hantavirus), calciviruses (e.g. Norwalk virus), filoviruses (e.g. , Ebola, 
5 Marburg). 

2.3.1 Modification of Polynucleotide Coding Sequences 

HIV-1 coding sequences, and related sequences, may be modified to have 
improved expression in target cells relative to the corresponding wild-type sequences. 
10 Following here are some exemplary modifications that can be made to such coding 
sequences. 

First, the HIV-1 codon usage pattern may be modified so that the resulting 
nucleic acid coding sequence are comparable to codon usage found in highly 
expressed human genes. The HIV codon usage reflects a high content of the 

15 nucleotides A or T of the codon-triplet. The effect of the HIV-1 codon usage is a high 
AT content in the DNA sequence that results in a decreased translation ability and 
instability of the mRNA. In comparison, highly expressed human codons prefer the 
nucleotides G or C. The HIV coding sequences may be modified to be comparable to 
codon usage found in highly expressed human genes. 

20 Second, there are inhibitory (or instability) elements (INS) located within the 

coding sequences of, for example, the Gag coding sequences. The RRE is a secondary 
RNA structure that interacts with the HIV encoded Rev-protein to overcome the 
expression down-regulating effects of the INS. To overcome the post-transcriptional 
activating mechanisms of RRE and Rev, the instability elements can be inactivated by 

25 introducing multiple point mutations that do not alter the reading frame of the encoded 
proteins. 

Third, for some genes the coding sequence has been altered such that the 
polynucleotide coding sequence encodes a gene product that is inactive or non- 
functional (e.g., inactivated polymerase, protease, tat, rev, nef, vif, vpr, and/or vpu 
30 gene products). Example 1 describes some exemplary mutations. 
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The synthetic coding sequences are assembled by methods known in the art, 
for example by companies such as the Midland Certified Reagent Company (Midland, 
Texas), following the guidance of the present specification. 

Some exemplary synthetic polynucleotide sequences encoding immunogenic 
5 HIV polypeptides and the polypeptides encoded thereby for use in the methods of the 
present invention have been described, for example, in PCT International Publication 
Nos. WO/00/39303, WO/00/39302, WO 00/39304, WO/02/04493, WO/03/020876, 
WO/03/004620, and WO/03/004657. 

In a preferred embodiment, the present invention relates to polynucleotides 

10 encoding Env polypeptides and corresponding Env polypeptides. For example, the 
codon usage pattern for Env may be modified so that the resulting nucleic acid coding 
sequence is comparable to codon usage found in highly expressed human genes. Such 
synthetic Env sequences are capable of higher level of protein production relative to 
the native Env sequences (see, for example, PCT International Publication Nos. 

15 WO/00/39302). Modification of the Env polypeptide coding sequences results in 
improved expression relative to the wild-type coding sequences in a number of 
mammalian cell lines (as well as other types of cell lines, including, but not limited to, 
insect cells). Similar Env polypeptide coding sequences can be obtained, modified 
and tested for improved expression from a variety of isolates. 

20 Further modifications of Env include, but are not limited to, generating 

polynucleotides that encode Env polypeptides having mutations and/or deletions 
therein. For instance, the hypervariable regions, VI and/or V2, can be deleted as 
described herein. In addition, the variable regions V3, V4 and/ or V5 can be modified 
or deleted. (See e.g, US 6,602,705) Additionally, other modifications, for example to 

25 the bridging sheet region and/or to N-glycosylation sites within Env can also be 

performed following the teachings of the present specification, (see, Figures 2A-2E, 
as well as PCT International Publication Nos. WO/00/39303, WO/00/39302, WO 
00/39304, WO/02/04493, WO/03/020876, and WO/03/004620). Other useful 
modifications of env are well known and include those described in Schulke et al., (J. 

30 Virol. 2002 76:7760), Yang et al. 2002, (J. Virol. 2002 76:4634), Yang et al. 2001 ( J. 
Virol. 2001 75:1 165), Shu et al. (Biochem. 1999 38:5378), Farzan et al. (J.Virol. 1998 
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72:7620) and Xiang et al. (J.Virol. 2002 76:9888). Various combinations of these 
modifications can be employed to generate synthetic expression cassettes and 
corresponding polypeptides as described herein. 

The present invention also includes expression cassettes which include 
5 synthetic sequences derived HIV genes other than Env, including but not limited to, 
regions within Gag, Env, Pol, as well as, tat, rev, nef, vif, vpr, and vpu. Further, the 
present invention includes synthetic polynucleotides and/or expression cassettes (as 
well as polypeptide encoded thereby) comprising two or more antigenic polypeptides. 
Such sequences may be used, for example, in their entirety or sequences encoding 

10 specific epitopes or antigens may be selected from the synthetic coding sequences 
following the teachings of the present specification and information known in the art. 
For example, the polypeptide sequences encoded by the polynucleotides may be 
subjected to computer analysis to predict antigenic peptide fragments within the full- 
length sequences. The corresponding polynucleotide coding fragments may then be 

1 5 used in the constructs of the present invention. Exemplary algorithms useful for such 
analysis include, but are not limited to, the following: 

AMPHI. This program has been used to predict T-cell epitopes (Gao, et al., 
(1989) J. Immunol. 143:3007; Roberts, et al, (1996) AIDS Res Hum Retrovir 12:593; 
Quakyi, et al., (1992) Scand J Immunol suppl. 11:9). The AMPHI algorithm is 

20 available int the Protean package of DNASTAR, Inc. (Madison, WI, USA). 

ANTIGENIC INDEX. This algorithm is useful for predicting antigenic 
determinants (Jameson & Wolf, (1998) CABIOS 4:181:186; Sherman, KE, et al., 
Hepatology 1996 Apr;23(4):688-94; Kasturi, KN, et al, J Exp Med 1995 Mar 
1;1 8 1(3): 1027-36; vanKampen V, et al., Mol Immunol 1994 Oct;3 1(1 5): 1 133-40; 

25 Ferroni P, et al., J Clin Microbiol 1993 Jun;31(6):1586-91; Beattie J, et al., Eur J 
Biochem 1992 Nov 15;210(l):59-66; Jones GL, et al, Mol Biochem Parasitol 1991 
Sep;48(l):l-9). 

HYDROPHILICITY. One algorithm useful for determining antigenic 
determinants from amino acid sequences was disclosed by Hopp & Woods (1981) 
30 (PNAS USA 78:3824-3828. 
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Default parameters, for the above-recited algorithms, may be used to determine 
antigenic sites. Further, the results of two or more of the above analyses may be 
combined to identify particularly preferred fragments. 

5 2.3.2 Further Modification of Polynucleotide Sequences And 
Polypeptides Encoded Thereby 

The immunogenic viral polypeptide-encoding expression cassettes described 
herein may also contain one or more further sequences encoding, for example, one or 
more transgenes. In one embodiment of the present invention, the polynucleotide 

10 component may comprise coding sequences for one or more HIV immunogenic 
polypeptides. Further, the polypeptide component may comprise one or more HIV 
immunogenic polypeptide. In a different embodiment of the present invention, a 
polynucleotide component may comprise coding sequences for one or more HIV 
immunogenic polypeptides, wherein the polynucleotide component further comprises 

15 a sequence encoding an additional antigenic polypeptide, with the proviso that the 
additional antigenic polypeptide is not an immunogenic polypeptide derived from an 
HIV-1 strain. Further, the polypeptide component may comprise one or more HTV 
immunogenic polypeptides, wherein the polypeptide component further comprises an 
additional antigenic polypeptide, with the proviso that the additional antigenic 

20 polypeptide is not an immunogenic polypeptide derived from an HIV-1 strain. 

Further sequences (e.g., transgenes) useful in the practice of the present 
invention include, but are not limited to, further sequences are those encoding further 
viral epitopes/antigens {including but not limited to, HCV antigens (e.g., El, E2; 
Houghton, M.., et al., U.S. Patent No. 5,714,596, issued February 3, 1998; Houghton, 

25 M.., et al., U.S. Patent No. 5,712,088, issued January 27, 1998; Houghton, M.., et al., 
U.S. Patent No. 5,683,864, issued November 4, 1997; Weiner, A.J., et al., U.S. Patent 
No. 5,728,520, issued March 17, 1998; Weiner, A.J., et al., U.S. Patent No. 5,766,845, 
issued June 16, 1998; Weiner, A.J., et al., U.S. Patent No. 5,670,152, issued 
September 23, 1997), HIV antigens (e.g., derived from one or more HIV isolate); and 

30 sequences encoding tumor antigens/epitopes. Further sequences may also be derived 
from non-viral sources, for instance, sequences encoding cytokines such interleukin-2 
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(IL-2), stem cell factor (SCF), interleukin 3 (IL-3), interleukin 6 (IL-6), interleukin 12 
(IL-12), G-CSF, granulocyte macrophage-colony stimulating factor (GM-CSF), 
interleukin- 1 alpha (IL-1 alpha), interleukin- 11 (IL-11), MIP-1, tumor necrosis factor 
(TNF), leukemia inhibitory factor (LIF), c-kit ligand, thrombopoietin (TPO) and flt3 
5 ligand, commercially available from several vendors such as, for example, Genzyme 
(Framingham, MA), Genentech (South San Francisco, CA), Amgen (Thousand Oaks, 
CA), R&D Systems and Immunex (Seattle, WA). Additional sequences are described 
herein below. 

HIV polypeptide coding sequences can be obtained from other HIV isolates, 
10 see, e.g., Myers et al. Los Alamos Database, Los Alamos National Laboratory, Los 
Alamos, New Mexico (1992); Myers et al., Human Retroviruses and Aids, 1997, Los 
Alamos, New Mexico: Los Alamos National Laboratory. Synthetic expression 
cassettes can be generated using such coding sequences as starting material by 
following the teachings of the present specification. 
15 Further, the synthetic expression cassettes of the present invention include 

related polypeptide sequences having greater than 85%, preferably greater than 90%, 
more preferably greater than 95%, and most preferably greater than 98% sequence 
identity to the polypeptides encoded by the synthetic expression cassette sequences 
disclosed herein. 

20 Exemplary expression cassettes and modifications are set forth in Example 1 

and are discussed further herein below. 

Further, the polynucleotides of the present invention.may comprise alternative 
polymer backbone structures such as, but not limited to, polyvinyl backbones (Pitha, 
Biochem Biophys Acta, 204:39, 1970a; Pitha, Biopolymers, 9:965, 1970b), and 

25 morpholino backbones (Summerton, J., et al, U.S. Patent No. 5,142,047, issued 

08/25/92; Summerton, J., et al, U.S. Patent No. 5,185,444 issued 02/09/93). A variety 
of other charged and uncharged polynucleotide analogs have been reported. 
Numerous backbone modifications are known in the art, including, but not limited to, 
uncharged linkages {e.g. , methyl phosphonates, phosphotriesters, phosphoamidates, 

30 and carbamates) and charged linkages (e.g., phosphorothioates and 
phosphorodithioates. 
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2.3.3 Exemplary Cloning Vectors and Systems for Use with the 

Polynucleotide Sequences Encoding Immunogenic Polypeptides 

Polynucleotide sequences for use in the compositions and methods of the 
5 present invention can be obtained using recombinant methods, such as by screening 
cDNA and genomic libraries from cells expressing the gene, or by deriving the gene 
from a vector known to include the same. Furthermore, the desired gene can be 
isolated directly from cells and tissues containing the same, using standard techniques, 
such as phenol extraction and PCR of cDNA or genomic DNA. See, e.g., Sambrook 

10 et al., supra, for a description of techniques used to obtain and isolate DNA. The gene 
of interest can also be produced synthetically, rather than cloned. The nucleotide 
sequence can be designed with the appropriate codons for the particular amino acid 
sequence desired. In general, one will select preferred codons for the intended host in 
which the sequence will be expressed. The complete sequence is assembled from 

15 overlapping oligonucleotides prepared by standard methods and assembled into a 
complete coding sequence. See, e.g., Edge, Nature (1981) 292:756; Nambair et al., 
Science (1984) 223:1299; Jay et al.,./. Biol. Chem. (1984) 259:6311; Stemmer, 
W.P.C., (1995) Gene 164:49-53. 

Next, the gene sequence encoding the desired antigen can be inserted into a 

20 vector containing a synthetic expression cassette of the present invention. In one 
embodiment, polynucleotides encoding selected antigens are separately cloned into 
expression vectors (e.g., a first Env-coding polynucleotide in a first vector, a second 
analogous Env-coding polynucleotide in a second vector). In certain embodiments, 
the antigen is inserted into or adjacent a synthetic Gag coding sequence such that 

25 when the combined sequence is expressed it results in the production of VLPs 
comprising the Gag polypeptide and the antigen of interest, e.g., Env (native or 
modified) or other antigen(s) (native or modified) derived from HIV. Insertions can 
be made within the coding sequence or at either end of the coding sequence (5', amino 
terminus of the expressed Gag polypeptide; or 3', carboxy terminus of the expressed 

30 Gag polypeptide)(Wagner, R, et al., Arch Virol. 127:117-137, 1992; Wagner, R., et 
al., Virology 200:162-175, 1994; Wu, X., et al., /. Virol. 69(6):3389-3398, 1995; 
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Wang, C-T., et al., Virology 200:524-534, 1994; Chazal, N., et al., Virology 68(1):1 1 1- 
122, 1994; Griffiths, J.C., et al., J. Virol 67(6):3 191-3 198, 1993; Reicin, A.S., et al., 
J. Virol. 69(2):642-650, 1995). Up to 50% of the coding sequences of p55Gag can be 
deleted without affecting the assembly to virus-like particles and expression efficiency 
5 (Borsetti, A., et al, J. Virol. 72(1 1):93 1 3-93 1 7, 1 998; Gamier, L., et al., J Virol 

1H6YA661-A611, 1998; Zhang, Y., et al., J Virol 72(3): 1782- 1789, 1998; Wang, C, 
et al., J Virol 72(10): 7950-7959, 1998). When sequences are added to the amino 
terminal end of Gag, the polynucleotide can contain coding sequences at the 5' end 
that encode a signal for addition of a myristic moiety to the Gag-containing 

1 0 polypeptide (e.g., sequences that encode Met-Gly). 

Expression cassettes for use in the practice of the present invention can also 
include control elements operably linked to the coding sequence that allow for the 
expression of the gene in vivo in the subject species. For example, typical promoters 
for mammalian cell expression include the SV40 early promoter, a CMV promoter 

1 5 such as the CMV immediate early promoter, the mouse mammary tumor virus LTR 
promoter, the adenovirus major late promoter (Ad MLP), and the herpes simplex virus 
promoter, among others. Other nonviral promoters, such as a promoter derived from 
the murine metallothionein gene, will also find use for mammalian expression. 
Typically, transcription termination and polyadenylation sequences will also be 

20 present, located 3' to the translation stop codon. Preferably, a sequence for 

optimization of initiation of translation, located 5' to the coding sequence, is also 
present. Examples of transcription terminator/polyadenylation signals include those 
derived from SV40, as described in Sambrook et al., supra, as well as a bovine growth 
hormone terminator sequence. 

25 Enhancer elements may also be used herein to increase expression levels of the 

mammalian constructs. Examples include the SV40 early gene enhancer, as described 
in Dijkema et al., EMBO J. (1985) 4:761, the enhancer/promoter derived from the long 
terminal repeat (LTR) of the Rous Sarcoma Virus, as described in Gorman et al., Proc. 
Natl. Acad. Sci. USA (1982b) 79:6777 and elements derived from human CMV, as 

30 described in Boshart et al., Cell (1985) 41:521, such as elements included in the CMV 
intron A sequence. 
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Furthermore, plasmids can be constructed which include a chimeric antigen- 
coding gene sequences, encoding, e.g., multiple antigens/epitopes of interest, for 
example derived from more than one viral isolate. 

Typically the antigen coding sequences precede or follow the synthetic coding 
5 sequence and the chimeric transcription unit will have a single open reading frame 
encoding both the antigen of interest and the synthetic coding sequences. 
Alternatively, multi-cistronic cassettes (e.g., bi-cistronic cassettes) can be constructed 
allowing expression of multiple antigens from a single mRNA using the EMCV IRES, 
or the like. 

10 In one embodiment of the present invention, the polynucleotide component of 

an immune generating composition may comprise, for example, the following: a first 
expression vector comprising a first Env expression cassette, wherein the Env coding 
sequence is derived from a first HIV subtype, serotype, or strain, and a second 
expression vector comprising a second Env expression cassette, wherein the Env 

1 5 coding sequence is derived from a second HIV subtype, serotype, or strain. 

Expression cassettes comprising coding sequences of the present invention may be 
combined in any number of combinations depending on the coding sequence products 
(e.g., HIV polypeptides) to which, for example, an immunological response is desired 
to be raised. In yet another embodiment, synthetic coding sequences for multiple 

20 HIV-derived polypeptides may be constructed into a polycistronic message under the 
control of a single promoter wherein IRES are placed adjacent the coding sequence for 
each encoded polypeptide. 

Exemplary polynucleotide sequences of interest for use in the present 
invention may be derived from strains including, but not limited to: subtype B-SF162, 

25 subtype C-TV1 .8_2 (8_2_TV1_C.ZA), subtype C-TV1 .8_5 (8_5_TV1_C.ZA), 
subtype C-TV2.12-5/1 (12-5_1_TV2_C.ZA), subtype C-MJ4, India subtype C- 
93IN101, subtype A-Q2317, subtype D-92UG001, subtype E-cm235, subtype A HIV- 
1 isolate Q23-17 from Kenya GenBank Accession AF004885, subtype A HIV-1 
isolate 98UA01 16 from Ukraine GenBank Accession AF413987, subtype A HIV-1 

30 isolate SE8538 from Tanzania GenBank Accession AF069669, subtype A Human 
immunodeficiency virus 1 proviral DNA, complete genome, clone:pUG031-Al 
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GenBank Accession AB098330, subtype D Human immunodeficiency virus type 1 
complete proviral genome, strain 92UG001 GenBank Accession AJ320484, subtype D 
HIV-1 isolate 94UG1 14 from Uganda GenBank Accession U88824, subtype D 
Human immunodeficiency virus type 1, isolate ELIGenBank Accession K03454, and 
5 Indian subtype C Human immunodeficiency virus type 1 subtype C genomic RNA 
GenBank Accession AB023804. 

Polynucleotide coding sequences used in the present invention may encode 
functional gene products or be mutated to reduce (relative to wild-type), attenuate, 
inactivate, eliminate, or render non-functional the activity of the gene product(s) 

10 encoded the synthetic polynucleotide. 

Once complete, the expression cassettes are typically used in constructs for 
nucleic acid immunization using standard gene delivery protocols. Methods for gene 
delivery are known in the art. See, e.g., U.S. Patent Nos. 5,399,346, 5,580,859, 
5,589,466. Genes can be delivered either directly to the vertebrate subject or, 

15 alternatively, delivered ex vivo, to cells derived from the subject and the cells 
reimplanted in the subject. 

A number of viral based systems have been developed for gene transfer into 
mammalian cells. Selected sequences can be inserted into a vector and packaged in 
retroviral particles using techniques known in the art. The recombinant virus can then 

20 be isolated and delivered to cells of the subject either in vivo or ex vivo. A number of 
viral based systems have been developed for use as gene transfer vectors for 
mammalian host cells. For example, retroviruses (in particular, lentiviral vectors) 
provide a convenient platform for gene delivery systems. A coding sequence of 
interest (for example, a sequence useful for gene therapy applications) can be inserted 

25 into a gene delivery vector and packaged in retroviral particles using techniques 

known in the art. Recombinant virus can then be isolated and delivered to cells of the 
subject either in vivo or ex vivo. A number of retroviral systems have been described, 
including, for example, the following: (U.S. Patent No. 5,219,740; Miller et al. (1989) 
BioTechniques 7:980; Miller, A.D. (1990) Human Gene Therapy 1:5; Scarpa et al. 

30 (1991) Virology 180:849; Burns et al. (1993) Proc. Natl. Acad. Sci. USA 90:8033; 
Boris-Lawrie et al. (1993) Cur. Opin. Genet. Develop. 3:102; GB 2200651; EP 
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0415731; EP 0345242; PCT International Publication No. WO 89/02468; PCT 
International Publication No. WO 89/05349; PCT International Publication No. WO 
89/09271; PCT International Publication No. WO 90/02806; PCT International 
Publication No. WO 90/07936; PCT International Publication No. WO 90/07936; PCT 
5 International Publication No. WO 94/03622; PCT International Publication No. WO 
93/25698; PCT International Publication No. WO 93/25234; PCT International 
- Publication No. WO 93/1 1230; PCT International Publication No. WO 93/10218; PCT 
International Publication No. WO 91/02805; in U.S. 5,219,740; U.S. 4,405,712; U.S. 
4,861,719; U.S. 4,980,289 and U.S. 4,777,127; in U.S. Serial No. 07/800,921; and in 

10 Vile (1993) Cancer Res 53:3860-3864; Vile (1993) Cancer Res 53:962-967; Ram 
(1993) Cancer Res 53:83-88; Takamiya (1992) JNeurosci Res 33:493-503; Baba 
(1993) JNeurosurg 79:729-735; Mann (1983) Cell 33:153; Cane (1984) Proc Natl 
Acad Sci USA 81_;6349; and Miller (1990) Human Gene Therapy \. 

One type of retrovirus, the murine leukemia virus, or "MLV", has been widely 

15 utilized for gene therapy applications (see generally Mann et al. (Cell 33:153, 1993), 
Cane and Mulligan (Proc, Nat 7. Acad. Sci. USA 81:6349, 1984), and Miller et al., 
Human Gene Therapy 1:5-14,1990. 

Lentiviral vectors may be readily constructed from a wide variety of 
lentiviruses (see RNA Tumor Viruses, Second Edition, Cold Spring Harbor 

20 Laboratory, 1985). Representative examples of lentiviruses included HIV, HIV- 1, 
HIV-2, FIV and SIV. Such lentiviruses may either be obtained from patient isolates, 
or, more preferably, from depositories or collections such as the American Type 
Culture Collection, or isolated from known sources using available techniques. 
Portions of the lentiviral gene delivery vectors (or vehicles) may be derived from 

25 different viruses. For example, in a given recombinant lentiviral vector, LTRs may be 
derived from an HIV, a packaging signal from SIV, and an origin of second strand 
synthesis from HrV-2. Lentiviral vector constructs may comprise a 5' lentiviral LTR, 
a tRNA binding site, a packaging signal, one or more heterologous sequences, an 
origin of second strand DNA synthesis and a 3' LTR. The lentiviral vectors have a 

30 nuclear transport element that, in preferred embodiments is not RRE. Representative 
examples of suitable nuclear transport elements include the element in Rous sarcoma 
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virus (Ogert, et al., J ViroL 70, 3834-3843, 1996), the element in Rous sarcoma virus 
(Liu & Mertz, Genes & Dev., 9, 1766-1789, 1995) and the element in the genome of 
simian retrovirus type I (Zolotukhin, et al., J Virol. 68, 7944-7952, 1994). Other 
potential elements include the elements in the histone gene (Kedes, Annu. Rev. 
5 Biochem. 48, 837-870, 1970), interferon gene (Nagata et al., Nature 287, 401-408, 
1980), adrenergic receptor gene (Koilka, et al., Nature 329, 75-79, 1987), and the c- 
Jun gene (Hattorie, et al., Proc. Natl. Acad. Sci. USA 85, 9148-9152, 1988). 

A number of adenovirus vectors have also been described. Unlike retroviruses 
which integrate into the host genome, adenoviruses persist extrachromosomally thus 

10 minimizing the risks associated with insertional mutagenesis (Haj-Ahmad and 
Graham, J. Virol. (1986) 57:267-274; Bett et al., J. Virol. (1993) 67:591 1-5921; 
Mittereder et al., Human Gene Therapy (1994) 5:717-729; Seth et al., J. Virol. (1994) 
68:933-940; Barr et al., Gene Therapy (1994) 1:51-58; Berkner, K.L. BioTechniques 
(1988) 6:616-629; and Rich et al., Human Gene Therapy (1993) 4:461-476). 

15 Additionally, various adeno-associated virus (AAV) vector systems have been 

developed for gene delivery. AAV vectors can be readily constructed using 
techniques well known in the art. See, e.g., U.S. Patent Nos. 5,173,414 and 5,139,941; 
- PCT International Publication Nos. WO 92/01070 (published 23 January 1 992) and 
WO 93/03769 (published 4 March 1993); Lebkowski et al., Molec. Cell. Biol. (1988) 

20 8 :3988-3996; Vincent et al, Vaccines 90 (1990) (Cold Spring Harbor Laboratory 

Press); Carter, B.J. Current Opinion in Biotechnology (1992) 3:533-539; Muzyczka, 
N. Current Topics in Microbiol, and Immunol. (1992) 158:97-129; Kotin, R.M. 
Human Gene Therapy (1994) 5:793-801; Shelling and Smith, Gene Therapy (1994) 
1:165-169; and Zhou et al., J. Exp. Med. (1994) 179:1867-1875. 

25 Another vector system useful for delivering the polynucleotides of the present 

invention is the enterically administered recombinant poxvirus vaccines described by 
Small, Jr., P.A., et al. (U.S. Patent No. 5,676,950, issued October 14, 1997). 

Additional viral vectors that will find use for delivering the nucleic acid 
molecules encoding the antigens of interest include those derived from the pox family 

30 of viruses, including vaccinia virus and avian poxvirus. By way of example, vaccinia 
virus recombinants expressing the genes can be constructed as follows. The DNA 
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encoding the particular immunogenic HIV polypeptide coding sequence is first 
inserted into an appropriate vector so that it is adjacent to a vaccinia promoter and 
flanking vaccinia DNA sequences, such as the sequence encoding thymidine kinase 
(TK). This vector is then used to transfect cells that are simultaneously infected with 
5 vaccinia. Homologous recombination serves to insert the vaccinia promoter plus the 
gene encoding the coding sequences of interest into the viral genome. The resulting 
TK'recombinant can be selected by culturing the cells in the presence of 5- 
bromodeoxyuridine and picking viral plaques resistant thereto. 

Alternatively, avipoxviruses, such as the fowlpox and canarypox viruses, can 

10 also be used to deliver the genes. Recombinant avipox viruses, expressing 

immunogens from mammalian pathogens, are known to confer protective immunity 
when administered to non-avian species. The use of an avipox vector is particularly 
desirable in human and other mammalian species since members of the avipox genus 
can only productively replicate in susceptible avian species and therefore are not 

15 infective in mammalian cells. Methods for producing recombinant avipoxviruses are 
known in the art and employ genetic recombination, as described above with respect 
to the production of vaccinia viruses. See, e.g., PCT International Publication Nos. 
WO 91/12882; WO 89/03429; and WO 92/03545. 

Molecular conjugate vectors, such as the adenovirus chimeric vectors 

20 described in Michael et al., /. Biol. Chem. (1993) 268:6866-6869 and Wagner et al., 
Proc. Natl. Acad. Sci. USA (1992) 89:6099-6103, can also be used for gene delivery. 

Members of the Alphavirus genus, such as, but not limited to, vectors derived 
from the Sindbis, Semliki Forest, and Venezuelan Equine Encephalitis viruses, will 
also find use as viral vectors for delivering the polynucleotides of the present 

25 invention (for example, first and second synthetic gpl40-polypeptide encoding 

expression cassette, wherein the first and second gpl40 polypeptides are analogous 
and derived from different HIV subtypes, serotypes, or strains). For a description of 
Sindbis-virus derived vectors useful for the practice of the instant methods, see, 
Dubensky et al., J. Virol. (1996) 70:508-519; and PCT International Publication Nos. 

30 WO 95/07995 and WO 96/17072; as well as, Dubensky, Jr., T.W., et al., U.S. Patent 
No. 5,843,723, issued December 1, 1998, and Dubensky, Jr., T.W., U.S. Patent No. 
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5,789,245, issued August 4, 1998. Preferred expression systems include, but are not 
limited to, eucaryotic layered vector initiation systems (e.g., US Patent No. 6,015,686, 
US Patent No. 5, 814,482, US Patent No. 6,015,694, US Patent No. 5,789,245, EP 
1029068A2, PCT International Publication No. WO 9918226A2/A3, EP 00907746A2, 
5 PCT International Publication No. WO 9738087A2). Exemplary expression systems 
include, but are not limited to, chimeric alphavirus replicon particles, for example, 
those that form VEE and SIN (see, e.g., Perri, et al., "An alphavirus replicon particle 
chimera derived from Venezuelan equine encephalitis and Sindbis viruses is a potent 
gene-based vaccine delivery vector," J. Virol 2003, 77(19), in press; PCT 

10 WO02/099035; USSN 10/3 10734, filed Dec 4 2002). Such alphavirus-based vector 
systems can be used in a prime or as a boost in DNA-primed subjects or potentially as 
a stand-alone immunization method for the induction of neutralizing antibodies using 
the multivalent approaches described herein. 

Expression cassette delivery vectors may also include tissue-specific promoters 

15 to drive expression of one or more genes or sequences of interest. 

Expression cassette delivery vector constructs may be generated such that 
more than one gene of interest is expressed. This may be accomplished through the 
use of di- or oligo-cistronic cassettes (e.g., where the coding regions are separated by 
80 nucleotides or less, see generally Levin et al., Gene 108:167-174, 1991), or through 

20 the use of Internal Ribosome Entry Sites ("IRES"). 

Synthetic expression cassettes of interest can also be delivered without a viral 
vector. For example, delivery of the expression cassettes of the present invention can 
also be accomplished using eucaryotic expression vectors comprising CMV-derived 
elements, such vectors include, but are not limited to, the following: pCMVKm2, 

25 pCMV-link pCMVPLEdhfr, and pCMV6a (see Example 1). For example, a synthetic 
DNA expression cassette of the present invention, e.g., one encoding gpl40 
polypeptide, may be cloned into the following eucaryotic expression vectors: 
pCMVKm2, for transient expression assays and DNA immunization studies, the 
pCMVKm2 vector is derived from pCMV6a (Chapman et al., Nuc. Acids Res. (1991) 

30 19:3979-3986) and comprises a kanamycin selectable marker, a ColEl origin of 

replication, a CMV promoter enhancer and Intron A, followed by an insertion site for 
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the synthetic sequences described below followed by a polyadenylation signal derived 
from bovine growth hormone ~ the pCMVKm2 vector differs from the pCMV-link 
vector only in that a polylinker site is inserted into pCMVKm2 to generate pCMV- 
link; pESN2dhfr and pCMVPLEdhfr, for expression in Chinese Hamster Ovary 
5 (CHO) cells; and, pAcC13, a shuttle vector for use in the Baculo virus expression 

system (pAcC13, is derived from pAcC12 which is described by Munemitsu S., et al., 
Mol Cell Biol. 10(ll):5977-5982, 1990). 

In additon, the expression cassettes of the present invention can be packaged in 
liposomes prior to delivery to the subject or to cells derived therefrom. Lipid 

10 encapsulation is generally accomplished using liposomes which are able to stably bind 
or entrap and retain nucleic acid. The ratio of condensed DNA to lipid preparation can 
vary but will generally be around 1:1 (mg DNA:micromoles lipid), or more of lipid. 
For a review of the use of liposomes as carriers for delivery of nucleic acids, see, Hug 
and Sleight, Biochim. Biophys. Acta. (1991) 1097 :1-17; Straubinger et al., in Methods 

15 ofEnzymology (1983), Vol. 101, pp. 512-527. 

Liposomal preparations for use in the present invention include cationic 
(positively charged), anionic (negatively charged) and neutral preparations, with 
cationic liposomes particularly preferred. Cationic liposomes have been shown to 
mediate intracellular delivery of plasmid DNA (Feigner et al., Proc. Natl. Acad. Sci. 

20 USA (1987) 84:7413-7416); mRNA (Malone et al., Proc. Natl. Acad. Sci. USA (1989) 
86:6077-6081); and purified transcription factors (Debs et al., J. Biol: Chem. (1990) 
265:10189-10192), in functional form. 

Cationic liposomes are readily available. For example, N[ 1-2,3- 
dioleyloxy)propyl]-N,N,N-triethylammonium (DOTMA) liposomes are available 

25 under the trademark Lipofectin, from GIBCO BRL, Grand Island, NY. (See, also, 

Feigner et al., Proc. Natl. Acad. Sci. USA (1987) 84:7413-7416). Other commercially 
available lipids include (DDAB/DOPE) and DOTAP/DOPE (Boerhinger). Other 
cationic liposomes can be prepared from readily available materials using techniques 
well known in the art. See, e.g., Szoka et al., Proc. Natl. Acad. Sci. USA (1978) 

30 75:4194-4198; PCT International Publication No. WO 90/1 1092 for a description of 
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the synthesis of DOTAP (l,2-bis(oleoyloxy)-3-(trimethylammonio)propane) 
liposomes. 

Similarly, anionic and neutral liposomes are readily available, such as, from 
Avanti Polar Lipids (Birmingham, AL), or can be easily prepared using readily 
5 available materials. Such materials include phosphatidyl choline, cholesterol, 
phosphatidyl ethanolamine, dioleoylphosphatidyl choline (DOPC), 
dioleoylphosphatidyl glycerol (DOPG), dioleoylphoshatidyl ethanolamine (DOPE), 
among others. These materials can also be mixed with the DOTMA and DOTAP 
starting materials in appropriate ratios. Methods for making liposomes using these 

10 materials are well known in the art. 

The liposomes can comprise multilammelar vesicles (MLVs), small 
unilamellar vesicles (SUVs), or large unilamellar vesicles (LUVs). The various 
liposome-nucleic acid complexes are prepared using methods known in the art. See, 
e.g., Straubinger et al., in METHODS OF IMMUNOLOGY (1983), Vol. 101, pp. 512- 

15 527; Szoka et al., Proc. Natl. Acad. Sci. USA (1978) 75:4194-4198; Papahadjopoulos 
et al., Biochim. Biophys. Acta (1975) 394:483; Wilson et al, Cell (1979) 17:77); 
Deamer and Bangham, Biochim. Biophys. Acta (1976) 443:629; Ostro et al., Biochem. 
Biophys. Res. Commun. (1977) 76:836; Fraley et al., Proc. Natl. Acad. Sci. USA 
(1979) 76:3348); Enoch and Strittmatter, Proc. Natl. Acad. Sci. USA (1979) 76:145); 

20 Fraley et al., /. Biol. Chem. (1980) 255:10431; Szoka and Papahadjopoulos, Proc. 
Natl. Acad. Sci. USA (1978) 75:145; and Schaefer-Ridder et al., Science (1982) 
215:166. 

The DNA and/or protein antigen(s) can also be delivered in cochleate lipid 
compositions similar to those described by Papahadjopoulos et al., Biochem. Biophys. 
25 Acta. (1975) 394:483-491. See, also, U.S. Patent Nos. 4,663,161 and 4,871,488. 

The expression cassettes of interest may also be encapsulated, adsorbed to, or 
associated with, particulate carriers. Such carriers present multiple copies of a 
selected antigen to the immune system and promote trapping and retention of antigens 
in local lymph nodes. The particles can be phagocytosed by macrophages and can 
30 enhance antigen presentation through cytokine release. Examples of particulate 
carriers include those derived from polymethyl methacrylate polymers, as well as 
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microparticles derived from poly(lactides) and poly(lactide-co-glycolides), known as 
PLC See, e.g., Jeffery et al., Pharm. Res. (1993) 10:362-368; McGee JP, et al., J 
Microencapsul. 14(2): 197-2 10, 1997; O'Hagan DT, et al., Vaccine 11 (2): 149-54, 
1993. Suitable microparticles may also be manufactured in the presence of charged 
5 detergents, such as anionic or cationic detergents, to yield microparticles with a 
surface having a net negative or a net positive charge. For example, microparticles 
manufactured with anionic detergents, such as hexadecyltrimethylammonium bromide 
(CTAB), i.e. CTAB-PLG microparticles, adsorb negatively charged macromolecules, 
such as DNA. (see, e.g., Int'l Application Number PCT/US99/17308). 

10 Furthermore, other particulate systems and polymers can be used for the in 

vivo or ex vivo delivery of the gene of interest. For example, polymers such as 
polylysine, polyarginine, polyornithine, spermine, spermidine, as well as conjugates of 
these molecules, are useful for transferring a nucleic acid of interest. Similarly, DEAE 
dextran-mediated transfection, calcium phosphate precipitation or precipitation using 

15 other insoluble inorganic salts, such as strontium phosphate, aluminum silicates 

including bentonite and kaolin, chromic oxide, magnesium silicate, talc, and the like, 
will find use with the present methods. See, e.g., Feigner, P.L., Advanced Drug 
Delivery Reviews (1990) 5:163-187, for a review of delivery systems useful for gene 
transfer. Peptoids (Zuckerman, R.N., et al., U.S. Patent No. 5,831,005, issued 

20 November 3, 1998) may also be used for delivery of a construct of the present 
invention. 

In some embodiments of the present invention, alum and PLG are useful 
delivery adjuvants that enhance immunity to polynucleotide vaccines (e.g., DNA 
vaccines). Further embodiments include, but are not limited to, toxoids, cytokines, 

25 and co-stimulatory molecules may also be used as genetic adjuvants with 
polynucleotide vaccines. 

Additionally, biolistic delivery systems employing particulate carriers such as 
gold and tungsten, are especially useful for delivering synthetic expression cassettes of 
the present invention. The particles are coated with the synthetic expression 

30 cassette(s) to be delivered and accelerated to high velocity, generally under a reduced 
atmosphere, using a gun powder discharge from a "gene gun." For a description of 
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such techniques, and apparatuses useful therefore, see, e.g., U.S. Patent Nos. 
4,945,050; 5,036,006; 5,100,792; 5,179,022; 5,371,015; and 5,478,744. Also, needle- 
less injection systems can be used (Davis, H.L., et al, Vaccine 12:1503-1509, 1994; 
Bioject, Inc., Portland, OR). 
5 Recombinant vectors carrying a synthetic expression cassette of the present 

invention are formulated into compositions for delivery to the vertebrate subject. 
These compositions may either be prophylactic (to prevent infection) or therapeutic (to 
treat disease after infection). If prevention of disease is desired, the compositions are 
generally administered prior to primary infection with the pathogen of interest. If 

10 treatment is desired, e.g., the reduction of symptoms or recurrences, the compositions 
are generally administered subsequent to primary infection. The compositions will 
comprise a "therapeutically effective amount" of the gene of interest such that an 
amount of the antigen can be produced in vivo so that an immune response is 
generated in the individual to which it is administered. The exact amount necessary 

1 5 will vary depending on the subject being treated; the age and general condition of the 
subject to be treated; the capacity of the subject's immune system to synthesize 
antibodies; the degree of protection desired; the severity of the condition being treated; 
the particular antigen selected and its mode of administration, among other factors, 
An appropriate effective amount can be readily determined by one of skill in the art. 

20 Thus, a "therapeutically effective amount" will fall in a relatively broad range that can 
be determined through routine trials. 

The compositions will generally include one or more "pharmaceutically 
acceptable excipients or vehicles" such as water, saline, glycerol, polyethyleneglycol, 
hyaluronic acid, ethanol, etc. Additionally, auxiliary substances, such as wetting or 

25 emulsifying agents, pH buffering substances, and the like, may be present in such 
vehicles. Certain facilitators of nucleic acid uptake and/or expression can also be 
included in the compositions or coadministered, such as, but not limited to, 
bupivacaine, cardiotoxin and sucrose. 

Once formulated, the compositions of the invention can be administered 

30 directly to the subject (e.g., as described above) or, alternatively, delivered ex vivo, to 
cells derived from the subject, using methods such as those described above. For 
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example, methods for the ex vivo delivery and reimplantation of transformed cells into 
a subject are known in the art and can include, e.g., dextran-mediated transfection, 
calcium phosphate precipitation, polybrene mediated transfection, lipofectamine and 
LT-1 mediated transfection, protoplast fusion, electroporation, encapsulation of the 
5 polynucleotide(s) (with or without the corresponding antigen) in liposomes, and direct 
microinjection of the DNA into nuclei. 

Direct delivery of synthetic expression cassette compositions in vivo will 
generally be accomplished with or without viral vectors, as described above, by 
injection using either a conventional syringe or a gene gun, such as the Accell® gene 

10 delivery system (PowderJect Technologies, Inc., Oxford, England). The constructs 
can be injected either subcutaneously, epidermally, intradermally, intramucosally such 
as nasally, rectally and vaginally, intraperitoneally, intravenously, orally or 
intramuscularly. Delivery of DNA into cells of the epidermis is particularly preferred 
as this mode of administration provides access to skin-associated lymphoid cells and 

15 provides for a transient presence of DNA in the recipient. Other modes of 

administration include oral and pulmonary administration, suppositories, needle-less 
injection, transcutaneous and transdermal applications. Dosage treatment may be a 
single dose schedule or a multiple dose schedule. Administration of polypeptides 
encoding immunogenic polypeptides is combined with administration of analogous 

20 immunogenic polypeptides following the methods of the present invention. 

2.3.4 Expression of Synthetic Sequences Encoding HIV-1 Polypeptides 
and Related Polypeptides 

Immunogenic viral polypeptide-encoding sequences of the present invention 
25 can be cloned into a number of different expression vectors/host cell systems to 

provide immunogenic polypeptides for the polypeptide component of the immune- 
response generating compositions of the present invention. For example, DNA 
fragments encoding HIV polypeptides can be cloned into eucaryotic expression 
vectors, including, a transient expression vector, CMV -promoter-based mammalian 
30 vectors, and a shuttle vector for use in baculovirus expression systems. Synthetic 

polynucleotide sequences (e.g., codon optimized polynucleotide sequences) and wild- 
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type sequences can typically be cloned into the same vectors. Numerous cloning 
vectors are known to those of skill in the art, and the selection of an appropriate 
cloning vector is a matter of choice. See, generally, Sambrook et al, supra. The 
vector is then used to transform an appropriate host cell. Suitable recombinant 
5 expression systems include, but are not limited to, bacterial, mammalian, 

baculovirus/insect, vaccinia, Semliki Forest virus (SFV), Alphaviruses (such as, 
Sindbis, Venezuelan Equine Encephalitis (VEE)), mammalian, yeast and Xenopus 
expression systems, well known in the art. Particularly preferred expression systems 
are mammalian cell lines, vaccinia, Sindbis, eucaryotic layered vector initiation 

10 systems (e.g., US Patent No. 6,015,686, US Patent No. 5, 814,482, US Patent No. 
6,015,694, US Patent No. 5,789,245, EP 1029068A2, PCT International Publication 
No. WO 9918226A2/A3, EP 00907746A2, PCT International Publication No. WO 
9738087A2), insect and yeast systems. 

A number of host cells for such expression systems are also known in the art. 

15 For example, mammalian cell lines are known in the art and include immortalized cell 
lines available from the American Type Culture Collection (A.T.C.C.), such as, but 
not limited to, Chinese hamster ovary (CHO) cells, HeLa cells, baby hamster kidney 
(BHK) cells, monkey kidney cells (COS), as well as others. Similarly, bacterial hosts 
such as E. coli, Bacillus subtilis, and Streptococcus spp., will find use with the present 

20 expression constructs. Yeast hosts useful in the present invention include inter alia, 
Saccharomyces cerevisiae, Candida albicans, Candida maltosa, Hansenula 
polymorpha, Kluyveromyces fragilis, Kluyveromyces lactis, Pichia guillerimondii, 
Pichia pastoris, Schizosaccharomyces pombe and Yarrowia lipolytica. Insect cells for 
use with baculovirus expression vectors include, inter alia, Aedes aegypti, Autographa 

25 californica, Bombyx mori, Drosophila melanogaster, Spodoptera frugiperda, and 

Trichoplusia ni. See, e.g., Summers and Smith, Texas Agricultural Experiment Station 
Bulletin No. 7555(1987). 

Viral vectors can be used for expression of polypeptides in eucaryotic cells, 
such as those derived from the pox family of viruses, including vaccinia virus and 

30 avian poxvirus. For example, a vaccinia based infection/transfection system, as 

described in Tomei et al., J. Virol. (1993) 67:4017-4026 and Selby et al., J. Gen. Virol. 
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(1993) 74:1 103-1 113, will also find use with the present invention. A vaccinia based 
infection/transfection system can be conveniently used to provide for inducible, 
transient expression of the coding sequences of interest in a host cell. In this system, 
cells are first infected in vitro with a vaccinia virus recombinant that encodes the 
5 bacteriophage T7 RNA polymerase. This polymerase displays exquisite specificity in 
that it only transcribes templates bearing T7 promoters. Following infection, cells are 
transfected with the polynucleotide of interest, driven by a T7 promoter. The 
polymerase expressed in the cytoplasm from the vaccinia virus recombinant 
transcribes the transfected DNA into RNA that is then translated into protein by the 

10 host translational machinery. The method provides for high level, transient, 

cytoplasmic production of large quantities of RNA and its translation products. See, 
e.g., Elroy-Stein and Moss, Proc. Natl. Acad. Sci. USA (1990) 87:6743-6747; Fuerst et 
al., Proc. Natl. Acad. Sci. USA (1986) 83:8122-8126. 

As an alternative approach to infection with vaccinia or avipox virus 

15 recombinants, an amplification system can be used that will lead to high level 

expression following introduction into host cells. Specifically, a T7 RNA polymerase 
promoter preceding the coding region for T7 RNA polymerase can be engineered. 
Translation of RNA derived from this template will generate T7 RNA polymerase 
which in turn will transcribe more template. Concomitantly, there will be a cDNA 

20 whose expression is under the control of the T7 promoter. Thus, some of the T7 RNA 
polymerase generated from translation of the amplification template RNA will lead to 
transcription of the desired gene. Because some T7 RNA polymerase is required to 
initiate the amplification, T7 RNA polymerase can be introduced into cells along with 
the template(s) to prime the transcription reaction. The polymerase can be introduced 

25 as a protein or on a plasmid encoding the RNA polymerase. For a further discussion 
of T7 systems and their use for transforming cells, see, e.g., PCT International 
Publication No. WO 94/2691 1; Studier and Moffatt, J. Mol. Biol. (1986) 189:113-130: 
Deng and Wolff, Gene (1994) 143:245-249; Gao et al., Biochem. Biophys. Res. 
Commun. (1994) 200:1201-1206; Gao and Huang, Nuc. Acids Res. (1993) 21:2867- 

30 2872; Chen et al., Nuc. Acids Res. (1994) 22:21 14-2120; and U.S. Patent No. 
5,135,855. 
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These vectors are transfected into an appropriate host cell. The cell lines are 
then cultured under appropriate conditions and the levels of any appropriate 
polypeptide product can be evaluated in supematants. For example, p24 can be used to 
evaluate Gag expression; gpl60, gpl40 or gpl20 can be used to evaluate Env 
5 expression; p6pol can be used to evaluate Pol expression; prot can be used to evaluate 
protease; pi 5 for RNAseH; p31 for Integrase; and other appropriate polypeptides for 
Vif, Vpr, Tat, Rev, Vpu and Nef. 

Further, modified polypeptides can also be used, for example, other Env 
polypeptides include, but are not limited to, for example, native gpl60, oligomeric 
10 gpl40, monomelic gpl20 as well as modified and/or synthetic sequences of these 
polypeptides. 

Western Blot analysis can be used to show that cells containing the synthetic 
expression cassette produce the expected protein, typically at higher per-cell 
concentrations than cells containing the native expression cassette. The HIV proteins 

15 can be seen in both cell lysates and supematants. 

Fractionation of the supematants from mammalian cells transfected with the 
synthetic expression cassette can be used to show that the cassettes provide superior 
production of HIV proteins and relative to the wild-type sequences. 

Efficient expression of these HrV-containing polypeptides in mammalian cell 

20 lines provides the following benefits: the polypeptides are free of baculovirus 

contaminants; production by established methods approved by the FDA; increased 
purity; greater yields (relative to native coding sequences); and a novel method of 
producing the Sub HIV-containing polypeptides in CHO cells which is not feasible in 
the absence of the increased expression obtained using the constructs of the present 

25 invention. Exemplary Mammalian cell lines include, but are not limited to, BHK, 
VERO, HT1080, 293, 293T, RD, COS-7, CHO, Jurkat, HUT, SUPT, C8166, 
MOLT4/clone8, MT-2, MT-4, H9, PM1, CEM, and CEMX174 (such cell lines are 
available, for example, from the A.T.C.C.). 

The desired polypeptide encoding sequences can be cloned into any number of 

30 commercially available vectors to generate expression of the polypeptide in an 

appropriate host system. These systems include, but are not limited to, the following: 
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baculovirus expression {Reilly, P.R., et al, Baculovirus Expression Vectors: A 
Laboratory Manual (1992); Beames, et al, Biotechniques 11:378 (1991); 
Pharmingen; Clontech, Palo Alto, CA)}, vaccinia expression {Earl, P. L., et al, 
"Expression of proteins in mammalian cells using vaccinia" In Current Protocols in 
5 Molecular Biology (F. M. Ausubel, et al. Eds.), Greene Publishing Associates & 

Wiley Interscience, New York (1991); Moss, B., et al, U.S. Patent Number 5,135,855, 
issued 4 August 1992}, expression in bacteria {Ausubel, F.M., et al, Current 
Protocols in Molecular Biology , John Wiley and Sons, Inc., Media PA; 
Clontech}, expression in yeast {Rosenberg, S. and Tekamp-Olson, P., U.S. Patent No. 

10 RE35,749, issued, March 17, 1998; Shuster, J.R., U.S. Patent No. 5,629,203, issued 
May 13, 1997; Gellissen, G., et al, Antonie Van Leeuwenhoek, 62(l-2):79-93 (1992); 
Romanos, M.A., et al, Yeast 8(6):423-488 (1992); Goeddel, D.V., Methods in 
Enzymology 185 (1990); Guthrie, C, and G.R. Fink, Methods in Enzymology 194 
(1991)}, expression in mammalian cells {Clontech; Gibco-BRL, Ground Island, NY; 

15 e.g., Chinese hamster ovary (CHO) cell lines (Haynes, J., et al, Nuc. Acid. Res. 
11:687-706 (1983); 1983, Lau, Y.F., etal.,Mol. Cell. Biol. 4:1469-1475 (1984); 
Kaufman, R. J., "Selection and coamplification of heterologous genes in mammalian 
cells," in Methods in Enzymology, vol. 185, pp537-566. Academic Press, Inc., San 
Diego CA (1991)}, and expression in plant cells {plant cloning vectors, Clontech 

20 Laboratories, Inc., Palo Alto, CA, and Pharmacia LKB Biotechnology, Inc., 

Pistcataway, NJ; Hood, E., et al, J. Bacteriol. 168:1291-1301 (1986); Nagel, R., et al, 
FEMS Microbiol. Lett. 67:325 (1990); An, et al, "Binary Vectors", and others in 
Plant Molecular Biology Manual A3:l-19 (1988); Miki, B.L.A., et al, pp.249-265, 
and others in Plant DNA Infectious Agents (Hohn, T., et al, eds.) Springer- Verlag, 

25 Wien, Austria, (1987); Plant Molecular Biology: Essential Techniques, P.G. Jones 
and J.M. Sutton, New York, J. Wiley, 1997; Miglani, Gurbachan Dictionary of Plant 
Genetics and Molecular Biology, New York, Food Products Press, 1998; Henry, R. J., 
Practical Applications of Plant Molecular Biology, New York, Chapman & Hall, 
1997}. 

30 In addition to the mammalian, insect, and yeast vectors, the synthetic 

expression cassettes of the present invention can be incorporated into a variety of 
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expression vectors using selected expression control elements. Appropriate vectors 
and control elements for any given cell can be selected by one having ordinary skill in 
the art in view of the teachings of the present specification and information known in 
the art about expression vectors. 
5 For example, a synthetic coding sequence can be inserted into a vector that 

includes control elements operably linked to the desired coding sequence, which allow 
for the expression of the coding sequence in a selected cell-type. For example, typical 
promoters for mammalian cell expression include the SV40 early promoter, a CMV 
promoter such as the CMV immediate early promoter (a CMV promoter can include 

10 intron A), RSV, HIV-Ltr, the mouse mammary tumor virus LTR promoter (MMLV- 
ltr), the adenovirus major late promoter (Ad MLP), and the herpes simplex virus 
promoter, among others. Other nonviral promoters, such as a promoter derived from 
the murine metallothionein gene, will also find use for mammalian expression. 
Typically, transcription termination and polyadenylation sequences will also be 

15 present, located 3' to the translation stop codon. Preferably, a sequence for 

optimization of initiation of translation, located 5' to the coding sequence, is also 
present. Examples of transcription terminator/polyadenylation signals include those 
derived from SV40, as described in Sambrook, et al., supra, as well as a bovine 
growth hormone terminator sequence. Introns, containing splice donor and acceptor 

20 sites, may also be designed into the constructs for use with the present invention 
(Chapman et al., Nuc. Acids Res. (1991) 19:3979-3986). 

Enhancer elements may also be used herein to increase expression levels of the 
mammalian constructs. Examples include the SV40 early gene enhancer, as described 
in Dijkema et al., EMBO J. (1985) 4:761, the enhancer/promoter derived from the long 

25 terminal repeat (LTR) of the Rous Sarcoma Virus, as described in Gorman et al., Proc. 
Natl. Acad. Sci. USA (1982b) 79:6777 and elements derived from human CMV, as , 
described in Boshart et al., Cell (1985) 41:521, such as elements included in the CMV 
intron A sequence (Chapman et al., Nuc. Acids Res. (1991) 19:3979-3986). 

Also included in the invention are expression cassettes, comprising coding 

30 sequences and expression control elements that allow expression of the coding regions 
in a suitable host. The control elements generally include a promoter, translation 
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initiation codon, and translation and transcription termination sequences, and an 
insertion site for introducing the insert into the vector. Translational control elements 
useful in expression of the polypeptides of the present invention have been reviewed 
by M. Kozak (e.g., Kozak, M., Mamm. Genome 7(8):563-574, 1996; Kozak, M., 
5 Biochimie 76(9):815-821, 1994; Kozak, M, J Cell Biol 108(2):229-241, 1989; Kozak, 
M., and Shatkin, A. J., Methods Enzymol 60:360-375, 1979). 

Expression in yeast systems has the advantage of commercial production. 
Recombinant protein production by vaccinia and CHO cell lines have the advantage of 
being mammalian expression systems. Further, vaccinia virus expression has several 

10 advantages including the following: (i) its wide host range; (ii) faithful post- 

transcriptional modification, processing, folding, transport, secretion, and assembly of 
recombinant proteins; (iii) high level expression of relatively soluble recombinant 
proteins; and (iv) a large capacity to accommodate foreign DNA. 

The recombinantly expressed polypeptides from immunogenic HIV 

15 polypeptide-encoding expression cassettes are typically isolated from lysed cells or 
culture media. Purification can be carried out by methods known in the art including 
salt fractionation, ion exchange chromatography, gel filtration, size-exclusion 
chromatography, size-fractionation, and affinity chromatography. Immunoaffinity 
chromatography can be employed using antibodies generated based on, for example, 

20 HIV antigens. Isolation of oligomeric forms of HIV envelope protein has been 
previously described (see, e.g., PCT International Application No. WO/00/39302). 

Advantages of expressing the proteins of the present invention using 
mammalian cells include, but are not limited to, the following: well-established 
protocols for scale-up production; cell lines are suitable to meet good manufacturing 

25 process (GMP) standards; culture conditions for mammalian cells are known in the art. 



2.3.5 immunogenicity enhancing components for use with the 
Polypeptide Component of the Present Invention 
• Compositions of the present invention for generating an immune response in a 
30 mammal, for example, comprising a polynucleotide component and a polypeptide 
component, can include various excipients, adjuvants, carriers, auxiliary substances, 
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modulating agents, and the like. The polypeptide component of the compositions of 
the present invention include an amount of the polypeptide sufficient to mount an 
immunological response. An appropriate effective amount can be determined by one 
of skill in the art. 

5 The polypeptide component may comprise a carrier wherein the carrier is a 

molecule that does not itself induce the production of antibodies harmful to the 
individual receiving the composition. Suitable carriers are typically large, slowly 
metabolized macromolecules such as proteins, polysaccharides, polylactic acids, 
polyglycollic acids, polymeric amino acids, amino acid copolymers, lipid aggregates 

10 (such as oil droplets or liposomes), and inactive virus particles. Examples of 

particulate carriers include those derived from polymethyl methacrylate polymers, as 
well as microparticles derived from poly(lactides) and poly(lactide-co-glycolides), 
known as PLC See, e.g., Jeffery et al., Pharm. Res. (1993) 10:362-368; McGee JP, et 
al., J Microencapsul. 14(2): 197-210, 1997; O'Hagan DT, et al., Vaccine ll(2):149-54, 

15 1993. Such carriers are well known to those of ordinary skill in the art. Additionally, 
these carriers may function as immunostimulating agents ("adjuvants"). Furthermore, 
the antigen may be conjugated to a bacterial toxoid, such as toxoid from diphtheria, 
tetanus, cholera, etc., as well as toxins derived from E. coli. 

Adjuvants may also be used to enhance the effectiveness of the compositions. 

20 Such adjuvants include, but are not limited to: (1) aluminum salts (alum), such as 
aluminum hydroxide, aluminum phosphate, aluminum sulfate, etc.; (2) oil-in- water 
emulsion formulations (with or without other specific immunostimulating agents such 
as muramyl peptides (see below) or bacterial cell wall components), such as for 
example (a) MF59 (PCT International Publication No. WO 90/14837), containing 5% 

25 Squalene, 0.5% Tween 80, and 0.5% Span 85 (optionally containing various amounts 
of MTP-PE (see below), although not required) formulated into submicron particles 
using a micro fluidizer such as Model HOY micro fluidizer (Micro fluidics, Newton, 
MA), (b) SAF, containing 10% Squalane, 0.4% Tween 80, 5% pluronic-blocked 
polymer L121, and thr-MDP (see below) either micro fluidized into a submicron 

30 emulsion or vortexed to generate a larger particle size emulsion, and (c) Ribi™ 

adjuvant system (RAS), (Ribi Immunochem, Hamilton, MT) containing 2% Squalene, 
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0.2% Tween 80, and one or more bacterial cell wall components from the group 
consisting of monophosphorylipid A (MPL), trehalose dimycolate (TDM), and cell 
wall skeleton (CWS), preferably MPL + CWS (Detox™); (3) saponin adjuvants, such 
as Stimulon™ (Cambridge Bioscience, Worcester, MA) may be used or particle 
5 generated therefrom such as ISCOMs (immunostimulating complexes); (4) Complete 
Freunds Adjuvant (CFA) and Incomplete Freunds Adjuvant (IF A); (5) cytokines, such 
as interleukins (IL-1, IL-2, etc.), macrophage colony stimulating factor (M-CSF), 
tumor necrosis factor (TNF), etc.; (6) oligonucleotides or polymeric molecules 
encoding immunostimulatory CpG motifs (Davis, H.L., et al., J. Immunology 

10 160:870-876, 1998; Sato, Y. et al., Science 273:352-354, 1996) or complexes of 
antigens/oligonucleotides {Polymeric molecules include double and single stranded 
RNA and DNA, and backbone modifications thereof, for example, methylphosphonate 
linkages; or (7) detoxified mutants of a bacterial ADP-ribosylating toxin such as a 
cholera toxin (CT), a pertussis toxin (PT), or an E. coli heat-labile toxin (LT), 

1 5 particularly LT-K63 (where lysine is substituted for the wild-type amino acid at 
position 63) LT-R72 (where arginine is substituted for the wild-type amino acid at 
position 72), CT-S109 (where serine is substituted for the wild-type amino acid at 
position 109), and PT-K9/G129 (where lysine is substituted for the wild- type amino 
acid at position 9 and glycine substituted at position 129) (see, e.g., PCT International 

20 Publication Nos. WO/93/13202 and WO/92/19265); (8) Muramyl peptides include, 
but are not limited to, N-acetyl-muramyl-L-threonyl-D-isoglutamine (thr-MDP), N- 
acteyl-normuramyl-L-alanyl-D-isogluatme (nor-MDP), N-acetylmuramyl-L-alanyl-D- 
isogluatminyl-L-alanine-2-(r-2'-dipalmitoyl-j«-glycero-3-huydroxyphosphoryloxy)- 
ethylamine (MTP-PE), etc.; (9) Iscomatrix (CSL Limited,Victoria, Australia; also, see, 

25 e.g., Morein B, Bengtsson KL, "Immunomodulation by iscoms, immune stimulating 
complexes," Methods. Sep;19(l):94-102, 1999) and (10) other substances that act as 
immunostimulating agents to enhance the effectiveness of the composition (e.g., Alum 
and CpG oligonucleotides). 

Preferred adjuvants include, but are not limited to, MF59 and Iscomatrix. 

30 Dosage treatment with the polypeptide component of the immune stimulating 

compositions of the present invention may be a single dose schedule or a multiple 
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dose schedule. A multiple dose schedule is one in which a primary course of 
vaccination may be with 1-10 separate doses, followed by other doses given at 
subsequent time intervals, chosen to maintain and/or reinforce the immune response, 
for example at 1-4 months for a second dose, and if needed, a subsequent dose(s) after 
5 several months. The dosage regimen will also, at least in part, be determined by the 
need of the subject and be dependent on the judgment of the practitioner. 

Direct delivery of the polypeptide component of the immune-response 
generating compositions of the present invention is generally accomplished, with or 
without adjuvants, by injection using either a conventional syringe or a gene gun, such 

10 as the Accell® gene delivery system (PowderJect Technologies, Inc., Oxford, 
England). The polypeptides can be injected either subcutaneously, epidermally, 
intradermally, intramuco sally such as nasally, rectally and vaginally, intraperitoneally, 
intravenously, orally or intramuscularly. Other modes of administration include oral 
and pulmonary administration, suppositories, and needle-less injection. Dosage 

1 5 treatment may be a single dose schedule or a multiple dose schedule. Administration 
of polypeptides may also be combined with administration of adjuvants or other 
substances. 

2.3.6 Immunomodulatory Molecules 

20 In some embodiments of the present invention, gene transfer vectors can be 

constructed to encode a cytokine or other immunomodulatory molecule. For example, 
nucleic acid sequences encoding native IL-2 and gamma-interferon can be obtained as 
described in US Patent Nos. 4,738,927 and 5,326,859, respectively, while useful 
muteins of these proteins can be obtained as described in U.S. Patent No. 4,853,332. 

25 Nucleic acid sequences encoding the short and long forms of mCSF can be obtained as 
described in US Patent Nos. 4,847,201 and 4,879,227, respectively. In particular 
aspects of the invention, retroviral vectors expressing cytokine or immunomodulatory 
genes can be produced (e.g., PCT International Publication No. WO/94/02951, entitled 
"Compositions and Methods for Cancer Immunotherapy"). 

30 Examples of suitable immunomodulatory molecules for use herein include the 

following: IL-1 and IL-2 (Karupiah et al. (1990) J. Immunology 144:290-298, Weber 
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et al. (1987) J. Exp. Med. 166:1716-1733, Gansbacher et al. (1990) J. Exp. Med. 
172:1217-1224, and U.S. Patent No. 4,738,927); IL-3 and IL-4 (Tepper et al. (1989) 
Cell 57:503-512, Golumbek et al. (1991) Science 254:713-716, and U.S. Patent No' 
5,017,691); IL-5 and IL-6 (Brakenhof et al. (1987) J. Immunol. 139:41 16-4121, and 
5 PCT International Publication No. WO 90/06370); IL-7 (U.S. Patent No. 4,965,195); 
IL-8, IL-9, IL-10, IL-11, IL-12, and IL-13 {Cytokine Bulletin, Summer 1994); IL-14 
and IL-15; alpha interferon (Finter et al. (1991) Drugs 42:749-765, U.S. Patent Nos. 
4,892,743 and 4,966,843, PCT International Publication No. WO 85/02862, Nagata et 
al. (1980) Nature 284:316-320, Familletti et al. (1981) Methods in Enz. 78:387-394, 

10 Twu et al. (1989) Proc. Natl. Acad. Sci. USA 86:2046-2050, and Faktor et al. (1990) 
Oncogene 5:867-872); beta-interferon (Seif et al. (1991) J. Virol. 65:664-671); 
gamma-interferons (Radford et al. ( 1 99 1 ) The American Society of Hepatology 
20082015, Watanabe et al. (1989) Proc. Natl. Acad. Sci. USA 86:9456-9460, 
Gansbacher et al. (1990) Cancer Research 50:7820-7825, Maio et al. (1989) Can. 

15 Immunol. Immunother. 30:34-42, and U.S. Patent Nos. 4,762,791 and 4,727,138); G- 
CSF (U.S. Patent Nos. 4,999,291 and 4,810,643); GM-CSF (PCT International 
Publication No. WO 85/04188). 

Immunomodulatory factors may also be agonists, antagonists, or ligands for 
these molecules. For example, soluble forms of receptors can often behave as 

20 antagonists for these types of factors, as can mutated forms of the factors themselves. 

Nucleic acid molecules that encode the above-described substances, as well as 
other nucleic acid molecules that are advantageous for use within the present 
invention, may be readily obtained from a variety of sources, including, for example, 
depositories such as the American Type Culture Collection, or from commercial 

25 sources such as British Bio-Technology Limited (Cowley, Oxford England). 

Representative examples include BBG 12 (containing the GM-CSF gene coding for 
the mature protein of 127 amino acids), BBG 6 (which contains sequences encoding 
gamma interferon), A.T.C.C. Deposit No. 39656 (which contains sequences encoding 
TNF), A.T.C.C. Deposit No. 20663 (which contains sequences encoding alpha- 

30 interferon), A.T.C.C. Deposit Nos. 31902, 31902 and 395 17 (which contain sequences 
encoding beta-interferon), A.T.C.C. Deposit No. 67024 (which contains a sequence 
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which encodes Interleukin-lb), A.T.C.C. Deposit Nos. 39405, 39452, 39516, 39626 
and 39673 (which contain sequences encoding Interleukin-2), A.T.C.C. Deposit Nos. 
59399, 59398, and 67326 (which contain sequences encoding Interleukin-3), A.T.C.C. 
Deposit No. 57592 (which contains sequences encoding Interleukin-4), A.T.C.C. 
5 Deposit Nos. 59394 and 59395 (which contain sequences encoding Interleukin-5), and 
A.T.C.C. Deposit No. 67153 (which contains sequences encoding Interleukin-6). 

Plasmids containing cytokine genes or immunomodulatory genes (PCT 
International Publication Nos. WO 94/02951 and WO 96/21015) can be digested with 
appropriate restriction enzymes, and DNA fragments containing the particular gene of 

10 interest can be inserted into a gene transfer vector using standard molecular biology 
techniques. (See, e.g., Sambrook et al., supra., or Ausubel et al. (eds) Current 
Protocols in Molecular Biology, Greene Publishing and Wiley-Interscience). 

Polynucleotide sequences coding for the above-described molecules can be 
obtained using recombinant methods, such as by screening cDNA and genomic 

15 libraries from cells expressing the gene, or by deriving the gene from a vector known 
to include the same. For example, plasmids that contain sequences that encode altered 
cellular products may be obtained from a depository such as the A.T.C.C, or from 
commercial sources. Plasmids containing the nucleotide sequences of interest can be 
digested with appropriate restriction enzymes, and DNA fragments containing the 

20 nucleotide sequences can be inserted into a gene transfer vector using standard 
molecular biology techniques. 

Alternatively, cDNA sequences for use with the present invention may be 
obtained from cells that express or contain the sequences, using standard techniques, 
such as phenol extraction and PCR of cDNA or genomic DNA. See, e.g., Sambrook 

25 et al., supra, for a description of techniques used to obtain and isolate DNA. Briefly, 
mRNA from a cell which expresses the gene of interest can be reverse transcribed 
with reverse transcriptase using oligo-dT or random primers. The single stranded 
cDNA may then be amplified by PCR (see U.S. Patent Nos. 4,683,202, 4,683,195 and 
4,800,159, see also PCR Technology: Principles and Applications for DNA 

30 Amplification, Erlich (ed.), Stockton Press, 1989)) using oligonucleotide primers 
complementary to sequences on either side of desired sequences. 
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The nucleotide sequence of interest can also be produced synthetically, rather 
than cloned, using a DNA synthesizer (e.g., an Applied Biosystems Model 392 DNA 
Synthesizer, available from ABI, Foster City, California). The nucleotide sequence 
can be designed with the appropriate codons for the expression product desired. The 
5 complete sequence is assembled from overlapping oligonucleotides prepared by 
standard methods and assembled into a complete coding sequence. See, e.g., Edge 
(1981) Nature 292:756; Nambair et al. (1984) Science 223:1299; Jay et al. (1984) J. 
Biol. Chem. 259:6311. 

10 2.4.0 Generation of Immune Response In Treated Subjects 

To evaluate efficacy, nucleic acid immunization using the polynucleotide 
component of the present invention (e.g., two expression cassettes each comprising a 
coding sequence for gpl40, wherein each coding sequence is derived from different 
HIV subtypes, serotypes, or strains) and antigenic immunization using the polypeptide 

15 component of the present invention (e.g., an oligomeric gpl40 wherein the coding 
sequence is derived from one of the HIV subtypes, serotypes, or strains represented in 
the polynucleotide component) can be performed, for example, as follows. 

Example 2 describes methods for the evaluation, in mice, of the 
immunogenicity of the compositions of the present invention used to induce immune 

20 response. The polynucleotide component described comprises two pCMVKM2 each 
carrying codon optimized coding sequences for gpl40 with delV2, the first coding 
sequence derived from SF162, subtype B, and the second coding sequence derived 
from TV1, subtype C. The mice are then immunized with oligomeric, codon 
optimized, gpl40 with delV2, derived from SF162, subtype B, polypeptide. Humoral 

25 and cellular immune responses are evaluated. The results of these assays are used to 
show the potency of the polynucleotide/polypeptide immunization methods of the 
present invention for the generation of an immune response in mice. 

Example 3 describes in vivo immunization studies that may be carried out in a 
variety of laboratory animals (including, mice, guinea pigs, rabbits, rhesus macaques, 

30 and baboons). Results of these studies are used to demonstrate the usefulness of the 
compositions and methods of the invention to generate immune responses, in 
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particular to generate broad and potent neutralizing activity against diverse HIV 
strains. 

Example 4 describes experiments performed in support of the present 
invention that evaluated immunogenicity regimens for various HIV polypeptide 
5 encoding plasmids used as primes and various HIV polypeptides used as boosts. In 
the example, the following vectors encoding gpl40 proteins were employed: pCMV 
gpl40 dV2 SF162 DNA and pCMV gpl40 dV2 TV1 DNA. These vectors comprise 
expression cassettes that encode gpl40 protein derived from two different HIV 
subtypes, subtype B (SF162) and subtype C (TV1). The V2 loop was deleted in both 

10 constructs and the coding sequences were codon optimized for expression in human 
cells. The specific gpl40 polynucleotides have been previously described (e.g., 
gpl40.modSF162.delV2, Figure 6, and gpl40.mut7.modSF162.delV2, Figure 7, see 
also, PCT International Publication No. WO/00/39302; and gpl40mod.TVl.delV2, 
Figure 8, and gpl40mod.TVl.mut7.delV2, Figure 9, see also PCT International 

15 Publication No. WO/02/04493). The ability of the compositions and methods of the 
present invention to generate neutralizing antibodies was evaluated. The results of the 
assays for the presence of neutralizing antibodies are presented in Figure 4 and Figure 
5. 

Figure 4 summarizes data showing the neutralization titers against HIV-1 
20 SF162 between seven experimental groups. These results demonstrated that all groups 
showed strong neutralizing activity against the HIV-1 SF162 isolate. Further, 
neutralizing activity significantly increased at post 4 th immunization compared to post 
3 rd immunizations. For the mixed (B+C) DNA prime and single protein boost, B 
protein gave a high boost to the mixed gene prime (B+C DNA + B prot), as did the C 
25 protein (B+C DNA + C prot). For the mixed DNA prime and protein boost, half dose 
(50ug) of protein (B+C DNA & prot (1/2)) induced neutralizing activity at least as 
well as the full dose of lOOug protein (B+C DNA & prot). 

Figure 5 summarizes data showing the neutralization titers against HrV-1 TV1 
(South African subtype C) between seven experimental groups. These results 
30 demonstrated that all groups showed neutralizing activity against HIV1 subtype C 

TV1 isolate (as expected, because no subtype C DNA or protein was used, the B DNA 
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+ B prot showed the lowest neutralizing activity). For the mismatched a single DNA 
prime and a single protein boost (C DNA + B prot), priming with C gene and boosting 
with B protein showed high titers, as did the B gene and B protein (B DNA + B prot). 
For the mixed (B+C) DNA prime and single protein boost, use of either B (B+C DNA 
5 + B prot) and C (B+C DNA + C prot) proteins had a similar boosting effect. 

Comparison of the data presented in Figure 4 and Figure 5 supports the 
combination methods of the present invention for generating an immune response in a 
subject, further, for generating neutralizing antibodies in immunized subjects. The 
data showed that the combination of DNA derived from different subtypes primed 

10 broad responses to multiple subtypes. This could be the result of the combined 
responses to subtype and/or sequence-specific continuous and/or discontinuous 
immunogenic epitopes as well as responses to the presentation of common conserved 
eptiopes in the oligomeric V2-deleted Env immunogens employed here. Furthermore, 
use of a single subtype protein was sufficient to boost broad neutralizing responses 

15 when immunity was primed with multiple subtypes of DNA. These results also 
demonstrated that use of lower doses of proteins mixture can also provide strong 
immune responses. 

Example 5 presents data demonstrating that a subject (in this example 
chimpanzees) can be immunized with an envelope protein from a first HIV strain of a 

20 given subtype (e.g., HIV-1 MN), be boosted with an envelope protein from a second 
HIV strain of the same subtype (e.g., HIV-1 SF162) and generate neutralizing 
antibodies against both HIV strains (see, for example Table 1 1 , Example 5). The data 
in Example 5 supports that the combination methods of the present invention can be 
used to broadly raise neutralizing antibodies against multiple viral strains from the 

25 same subtype. Further, the data presented in Example 4 in combination with the data 
presented in Example 5 together demonstrate that such HIV strains may be within 
subtype, or from different subtypes. 

These studies demonstrated the usefulness of the compositions (e.g., 
comprising a polynucleotide component and a polypeptide component) and methods 

30 of the invention to generate immune responses, in particular to generate broad and 
potent neutralizing activity against diverse HIV subtypes and strains. It is readily 
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apparent that the subject invention can be used to mount an immune response to a 
wide variety of antigens and hence to treat or prevent infection, particularly HIV 
infection. 

5 3.0.0 Applications of the Present Invention to HIV 

While not desiring to be bound by any particular model, theory, or hypothesis, 
the following information is presented to provide a more complete understanding of 
the present invention. 

Protection against HIV infection will likely require potent and broadly reactive 

10 pre-existing neutralizing antibodies in vaccinated individuals exposed to a virus 
challenge. Although cellular immune responses are desirable to control viremia in 
those who get infected, protection against infection has not been demonstrated for 
vaccine approaches that rely exclusively on the induction of these responses. For this 
reason, experiments performed in support of the present invention used combination 

1 5 prime-boost approaches that employ a polynucleotide component and a polypeptide 
component, wherein the polypeptide component encodes, for example, V-deleted 
envelope antigens from primary HIV isolates (e.g., R5 subtype B (HIV-1sfi62) and 
subtype C (HIV-Itvi) strains), and the polypeptide component comprises at least one 
of these antigens. 

20 The polynucleotide component of the present invention may be delivered by 

enhanced DNA or RNA [polylactide co-glycolide (PLG) microparticle formulations or 
electroporation], adenovirus-based vectors, alphavirus replicons or replicon particles, 
polynucleotide or particle-based vaccine approaches. Efficient in vivo expression of 
plasmid encoded genes by electrical permeabilization has been described (see, e.g., 

25 Zucchelli et al. (2000) J. Virol. 74: 1 1598-1 1607; Banga et al. (1998) Trends 

Biotechnol. 10:408-412; Heller et al. (1996) Febs Lett. 389:225-228; Mathiesen et al. 
(1999) Gene Ther. 4:508-514; Miretal. (1999) Proc. Nat 'I Acad Sci. USA 8:4262- 
4267; Nishi et al. (1996) Cancer Res. 5:1050-1055). The polypeptide component of 
the present invention may be administered, for example, by booster immunizations 

30 with Env proteins in MF59 or Iscomatrix adjuvant. 
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All protein preparations were highly purified and extensively characterized by 
biophysical and immunochemical methodologies. Results from rabbit 
immunogenicity studies indicated that broad neutralizing antibody responses could be 
consistently induced against diverse HIV strains (Example 4). Moreover, using the 
5 combination prime-boost vaccine regimens, potent HIV antigen-specific CD4 + and 
CD8+ T-cell responses may also be generated. 

Although any HIV viral protein may also be employed in the practice of the 
present invention, in a preferred embodiment VI -, V2-, and/or V3 -modified/deleted 
envelope DNA and corresponding polypeptides are good candidates for use in the 
1 0 compositions of the present invention. 

One embodiment of this aspect of the present invention may be described 
generally as follows. Antigens are selected for the vaccine composition(s). 
Polynucleotides encoding Env polypeptides and Env polypeptides are typically 
employed in a composition for generating an immunse response comprising a 
15 polynucleotide component and a polypeptide component. 

A nucleic acid prime is typically followed by at least one polypeptide boost. 
The boost may, for example, include adjuvanted HlV-derived polypeptides (e.g., 
analogous to those used for the DNA vaccinations), coding sequences for HlV-derived 
polypeptides (e.g., analogous to those used for the DNA vaccinations) encoded by a 
20 viral vector. Boosts may include further DNA vaccinations, and/or combinations of 
the foregoing. 

Further, different polypeptide antigens may be used in the boost relative to the 
initial vaccination and visa versa. In addition, the initial nucleic acid vaccination may 
be a viral vector comprising a DNA expression cassette construct. 

25 Some factors that may be considered in HIV envelope vaccine design are as 

follows. A fundamental criterion of an effective HIV vaccine is its ability to induce 
broad and potent neutralizing antibody responses against prevalent HIV strains. The 
important contribution of neutralizing antibodies in preventing the establishment of 
HIV, SIV and SHIV infection or delaying the onset of disease is highlighted by 

30 several studies. First, the emergence of neutralization-resistant viruses coincides or 
precedes the development of disease in infected animals (Burns (1993) / Virol. 
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67:4104-13; Cheng-Mayer et al. (1999) J. Virol. 73:5294-5300; Narayan et al. (1999) 
Virology 256:54-63). Second, the pre-infusion of high concentrations of potent 
neutralizing monoclonal antibodies (mAbs) in the blood circulation of macaques, 
chimpanzees and SCID mice prior to their challenge with HIV, SIV or SHIV viruses, 
5 offers protection or delays the onset of disease (Conley et al. (1996) J. Virol. 70:675 1- 
6758; Emini et al. (1992) Nature (London) 355:728-730; Gauduin et al. (1997) Nat 
Med. 3:1389-93; Mascola et al. (1999) J Virol. 73:4009-18; Mascola et al. (2000) 
Nature Med. 6(2):207-210; Baba et al. (2000) Nature Med. 6(2):200-206). Similarly, 
infusion of neutralizing antibodies collected from the serum of HIV- 1 -infected 

10 chimpanzees to naive pig-tailed macaques protects the latter animals from subsequent 
viral challenge by SHIV viruses (Shibata et al (1999) Nature Medicine 5:204-210). 
Moreover, envelope-based vaccines have demonstrated protection against infection in 
non-human primate models. Vaccines that exclude Env-polypeptides generally confer 
less protective efficacy (see, e.g., Hu, S.L., et al., Recombinant subunit vaccines as an 

1 5 approach to study correlates of protection against primate lentivirus infection, 

Immunol Lett. Jun;5 1(1-2): 1 15-9 (1996); Amara, R.R., et al., Critical role for Env as 
well as Gag-Pol in control of a simian-human immunodeficiency virus 89.6P 
challenge by a DNA prime/recombinant modified vaccinia virus Ankara vaccine, J 
Virol. Jun;76(12):6138-46 (2002)). 

20 Monomelic gpl20 protein-derived from the SF2 lab strain provided 

neutralization of HIV- 1 lab strains and protection against virus challenges in primate 
models (Verschoor, E.J., et al., (1999), "Comparison of immunity generated by 
nucleic acid, MF59 and ISCOM-formulated HIV-1 gpl20 vaccines in rhesus 
macaques," J. Virology 73: 3292-3300). Primary gp 120 protein derived from Thai E 

25 field strains provided cross-subtype neutralization of lab strains (VanCott, T.C., et al., 
(1999) "Cross-subtype neutralizing antibodies induced in baboons by a subtype E 
gpl20 immunogen based on an R5 primary human immunodeficiency virus type 1 
envelope," J. Virology 73: 4640-4650). Primary sub-type B oligomeric o-gpl40 
protein provided partial neutralization of subtype B primary (field) isolates (Barnett, 

30 S.W., et al. (2001) "The ability of an oligomeric HIV-1 envelope antigen to elicit 
neutralizing antibodies against primary HIV-1 isolates is improved following the 
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partial deletion of the second hypervariable region," J. Virology, 75:5526-5540). 

Primary sub-type B o-gpl40 delV2 DNA prime plus protein boost provided potent 

neutralization of diverse subtype B primary isolates and protection against virus 

challenge in primate models (Cherpelis, S., et al., (2000) "Vaccine-induced anti- 
5 envelope antibodies offer partial protection from SHIV infection to CD8+T-cell 

depleted rhesus macaques," J. Virology, 75, 1547-1550). 

Vaccine strategies for induction of potent, broadly reactive, neutralizing 

antibodies may be assisted by construction of Envelope polypeptide structures that 

expose conserved neutralizing epitopes, for example, variable-region 
10 modifications/deletions and de-glycosylations, envelope protein-receptor complexes, 

rational design based on crystal structure (e.g., beta-sheet deletions), and gp41-fusion 

domain based immunogens. 

Stable CHO cell lines for envelope protein production have been developed 

using optimized envelope polypeptide coding sequences, including, but not limited to, 
15 the following: gpl20, o-gpl40, gpl20delV2, o-gpl40delV2, gpl20delVlV2, o- 

gpl40delVlV2. 

Exemplary antigenic compositions and immunogenicity studies in support of 
the compositions and methods of the present invention are presented in Example 4. 
In a first particular aspect of the present invention for compositions for 

20 generating an immune response in a mammal, the polynucleotide component of the 
present invention consists essentially of one polynucleotide encoding an HIV 
immunogenic polypeptide, and the polypeptide component comprises of one or more 
HIV immunogenic polypeptides analogous to the polypeptide encoded by said 
polynucleotide component, with the proviso that at least one HIV immunogenic 

25 polypeptide of the polypeptide component is derived from a different HIV subtype, 
serotype, or strain than the coding sequence of the immunogenic polypeptide encoded 
by the polynucleotide component. In this context, the polynucleotide component 
consisting essentially of one polynucleotide encoding an HIV immunogenic 
polypeptide refers to the presence of one polynucleotide encoding one HIV 

30 immunogenic polypeptide in the composition. The polynucleotide composition may 
comprise further components in addition to the HIV immnunogenic polypeptide, such 



86 



PP21 100.001 
PATENT 

as immune enhancers, immunoregulatory components, vector sequences (e.g., viral or 
non-viral), carriers, particles, excipients, expression control sequences, etc. In one 
embodiment of this aspect of the present invention, the HIV immunogenic 
polypeptide encoded by the polynucleotide component is derived from a subtype B 
5 strain, and at least one coding sequence of an HIV immunogenic polypeptide of the 
polypeptide component is derived from a subtype C strain. 

In one embodiment a composition for generating an immune response in a 
mammal comprises, a polynucleotide component consisting essentially of a 
polynucleotide encoding an HIV immunogenic polypeptide derived from a first HIV 

10 strain of a first subtype, and a polypeptide component comprising one or more HIV 
immunogenic polypeptides analogous to the polypeptide encoded by the 
polynucleotide component, provided that at least one HIV immunogenic polypeptide 
of the polypeptide component is derived from a second HIV strain of the first subtype, 
wherein the first and second strain are different. In some embodiments of this aspect 

15 the polynucleotide component does not encode an analogous HIV immunogenic 

polypeptide derived from any subtype other than the first subtype, and the polypeptide 
component does not comprise an analogous HIV immunogenic polypeptide derived 
from any subtype other than the first subtype. 

In a second particular aspect of the present invention for compositions for 

20 generating an immune response in a mammal, the polynucleotide component 

comprises two or more polynucleotide sequences comprising coding sequences for 
two or more analogous HIV immunogenic polypeptides, wherein the coding sequences 
for at least two of the HIV immunogenic polypeptides are derived from different HIV 
subtypes, serotypes, or strains, and the polypeptide component comprises of one or 

25 more HIV immunogenic polypeptides analogous to the polypeptide encoded by said 
polynucleotide component, with the proviso that (i) if the polypeptide component 
provides less than the number of analogous HIV immunogenic polypeptides encoded 
by the polynucleotide component, then the HIV immunogenic polypeptides of the 
polypeptide composition may be derived from the same and/or different HIV 

30 subtypes, serotypes, or strains as the HIV immunogenic polypeptides provided by the 
polynucleotide component, or (ii) if the polypeptide component provides the same or 
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greater than the number of analogous HIV immunogenic polypeptides encoded by the 
polynucleotide component, then at least one of the HIV immunogenic polypeptides of 
the polypeptide composition is derived from a different HIV subtype, serotype, or 
strain than the HIV immunogenic polypeptides provided by the polynucleotide 
5 component. 

In one embodiment, the present invention includes a composition for use in 
generating an immune response in a subject, wherein the composition comprises a 
polynucleotide encoding an immunogenic HIV polypeptide and an analogous 
immunogenic HIV polypeptide from a different HIV subtype, serotype, or strain. The 

10 polynucleotide encoding an immunogenic HIV polypeptide is used for immunization 
via delivery of the polynucleotide (e.g., a prime), an analogous immunogenic HIV 
polypeptide derived from a different HIV subtype, serotype, or strain is used for 
immunization (e.g., a boost). For example, a DNA molecule is used for nucleic acid 
immunization, wherein the DNA molecule encodes an HIV envelope polypeptide (i) 

15 derived from an HIV subtype C isolate, and (ii) that is codon optimized for expression 
in mammalian cells. This DNA immunization is followed by a protein boost using an 
HIV envelope polypeptide derived from an HIV subtype B isolate. Exemplary 
envelope proteins include, but are not limited to, gpl20, gpl40, oligomeric gpl40, and 
gpl60, including mutated forms thereof (e.g., deletion of the V2 loop). One 

20 embodiment of this aspect of the present invention, comprises a composition for 
generating an immune response in a mammal, the composition comprising: a 
polynucleotide component having of a first polynucleotide encoding a first HIV 
immunogenic polypeptide; and a polypeptide component, comprising a second HIV 
immunogenic polypeptide, wherein the first and second immunogenic HIV 

25 polypeptide are derived from different HIV subtypes, serotypes, or strains, and (ii) the 
first and second immunogenic polypeptides encode analogous HIV polypeptides. 

A second embodiment the present invention includes a composition for use in 
generating an immune response in a subject, wherein the composition comprises a 
polynucleotide component comprising two or more polynucleotides encoding 

30 immunogenic HIV polypeptides, derived from at least two different subtypes, 
serotypes, or strains, and a polypeptide component having a single, analogous, 



88 



PP21 100.001 
PATENT 



immunogenic HTV polypeptides derived from one of the subtypes, serotypes, or strains 
that is used for boosting. For example, two DNA molecules are used for nucleic acid 
immunization, wherein the first DNA molecule encodes an HIV envelope polypeptide 
(i) derived from an HIV subtype C isolate, and (ii) that is codon optimized for 
5 expression in mammalian cells, and the second DNA molecule encodes an HIV 

envelope polypeptide (i) derived from an HIV subtype B isolate, and (ii) that is codon 
optimized for expression in mammalian cells. This DNA immunization is followed by 
a protein boost using a single HIV envelope polypeptide (i) derived from an HIV 
subtype B isolate or an HIV subtype C isolate. Exemplary envelope proteins include, 

10 but are not limited to, gpl20, gpl40, oligomeric gpl40, and gpl60, including mutated 
forms thereof (e.g., deletion of the V2 loop). One embodiment of this aspect of the 
present invention comprises a composition for generating an immune response in a 
mammal, the composition comprising: a polynucleotide component comprising a first 
polynucleotide encoding a first immunogenic HIV polypeptide, and a second 

15 polynucleotide encoding a second immunogenic HIV polypeptide, wherein (i) the first 
and second immunogenic HIV polypeptide are derived from different HIV subtypes, 
and (ii) the first and second immunogenic polypeptides encode analogous HIV 
polypeptides, and a polypeptide component, having the first HIV immunogenic 
polypeptide, or the second HIV immunogenic polypeptide, with the proviso that the 

20 polypeptide component comprises at least one less HIV immunogenic polypeptide 
than is encoded by the polynucleotide component. 

In another embodiment, a composition for generating an immune response in a 
mammal comprises a polynucleotide component comprising two or more 
polynucleotide sequences comprising coding sequences for two or more analogous 

25 HIV immunogenic polypeptides derived from a first HIV subtype, wherein the coding 
sequences for at least two of the HIV immunogenic polypeptides are derived from 
different HIV strains of the first subtype, and a polypeptide component that comprises 
one or more HIV immunogenic polypeptides analogous to the polypeptide encoded by 
the polynucleotide component, with the proviso that (i) if the polypeptide component 

30 comprises less than the number of analogous HIV immunogenic polypeptides encoded 
by the polynucleotide component, then the HTV immunogenic polypeptides of the 
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polypeptide composition may be derived from the same and/or different HIV strains of 
the first subtype, or (ii) if the polypeptide component comprises the same or greater 
than the number of analogous HIV immunogenic polypeptides encoded by the 
polynucleotide component, then at least one of the HIV immunogenic polypeptides of 
5 the polypeptide composition is derived from a different HIV strain of the first subtype; 
with the provisos that (i) the polynucleotide component does not encode an HIV 
immunogenic polypeptide derived from any subtype other than the first subtype, and 
(ii) the polypeptide component does not comprise an HIV immunogenic polypeptide 
derived from any subtype other than the first subtype. 

1 0 The polynucleotide component may comprise further components as described 

herein (e.g., carriers, vector sequences, control sequences, etc.). The polypeptide 
component may comprise further components as described herein (e.g., carriers, 
adjuvants, immunoenhancers, etc.). 

In a third aspect, the present invention relates to the use of varied doses of 

15 polynucleotides and polypeptides in immunization methods (e.g., prime/boost 
methods), particularly the methods described herein. Thus, another aspect of the 
invention provides a method of generating an immune response in a subject 
comprising administering a polynucleotide component consisting essentially of one 
polynucleotide encoding an HIV immunogenic polypeptide derived from a first HIV 

20 strain of a first subtype, to a subject under conditions that are compatible with the 
expression of said polynucleotide in said subject for the production of the encoded 
HIV immunogenic polypeptide; and, administering a polypeptide component 
comprising one or more HIV immunogenic polypeptides analogous to the polypeptide 
encoded by said polynucleotide component, with the proviso that at least one HIV 

25 immunogenic polypeptide of the polypeptide component is derived from a second 

strain of the first subtype, wherein said first HIV strain and said second HIV strain are 
different. Typically, the polynucleotide component does not encode an analogous 
HIV immunogenic polypeptide derived from any subtype other than the first subtype, 
and the polypeptide component does not comprise an analogous HIV immunogenic 

30 polypeptide derived from any subtype other than the first subtype. 



90 



PP2 1100.001 
PATENT 



Another aspect of the present invention provides a method of generating an 
immune response in a subject comprising administering a polynucleotide component 
comprising two or more polynucleotides comprising coding sequences for two or more 
analogous HIV immunogenic polypeptides derived from a first HIV subtype, wherein 
5 the coding sequences for at least two of the HIV immunogenic polypeptides are 
derived from different HIV strains of the first subtype, to a subject under conditions 
that are compatible with the expression of said polynucleotides in said subject for the 
production of the encoded HIV immunogenic polypeptides; and, administering a 
polypeptide component that comprises one or more HIV immunogenic polypeptides 

1 0 analogous to the polypeptide encoded by said polynucleotide component, with the 
proviso that if the polypeptide component comprises the same number or greater than 
the number of analogous HIV immunogenic polypeptides encoded by the 
polynucleotide component, then at least one of the HIV immunogenic polypeptides of 
the polypeptide composition is derived from a different HIV strain of the first subtype 

15 than the HIV immunogenic polypeptides provided by the polynucleotide component. 
Typically, the polynucleotide component does not encode an analogous HIV 
immunogenic polypeptide derived from any subtype other than the first subtype, and 
the polypeptide component does not comprise an analogous HIV immunogenic 
polypeptide derived from any subtype other than the first subtype. 

20 In a further aspect, the invention provides a method of generating an immune 

response in a subject comprising 

providing a composition comprising 

a polynucleotide component consisting essentially of one polynucleotide 
encoding an HIV immunogenic polypeptide derived from a first HIVstrain of a first 
25 subtype, and 

a polypeptide component comprising one or more HIV immunogenic 
polypeptides analogous to the polypeptide encoded by said polynucleotide component, 
with the proviso that at least one HIV immunogenic polypeptide of the polypeptide 
component is derived from a second HIV strain of the first subtype wherein said first 
30 and second strains are different; 
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administering a gene delivery vector comprising the polynucleotide of said 
polynucleotide component of the composition into said subject under conditions that 
are compatible with expression of said polynucleotide in said subject for the 
production of encoded HIV immunogenic polypeptides; and 
5 administering the polypeptide component to said subject. 

Yet another aspect of the invention provides a method of generating an 
immune response in a subject comprising 

providing a composition comprising a polynucleotide component comprising 
two or more polynucleotide sequences comprising coding sequences for two or more 

10 analogous HIV immunogenic polypeptides derived from a first HIV subtype, wherein 
the coding sequences for at least two of the HIV immunogenic polypeptides are 
derived from different HIV strains of the first subtype, and a polypeptide component 
comprising one or more HIV immunogenic polypeptides analogous to the polypeptide 
encoded by said polynucleotide component, with the proviso that if the polypeptide 

15 component comprises the same number or greater than the number of analogous HIV 
immunogenic polypeptides encoded by the polynucleotide component, then at least 
one of the HIV immunogenic polypeptides of the polypeptide composition is derived 
from a different HIV strain of the first subtype than the HIV immunogenic 
polypeptides provided by the polynucleotide component; 

20 administering one or more gene delivery vectors comprising the 

polynucleotides of said polynucleotide component of the composition into said subject 
under conditions that are compatible with expression of said polynucleotides in said 
subject for the production of encoded HIV immunogenic polypeptides; and 

administering the polypeptide component to said subject. Typically, the 

25 polynucleotide component of the composition does not encode an analogous HIV 
immunogenic polypeptide derived from any subtype other than the first subtype, and 
the polypeptide component of the composition does not comprise an analogous HIV 
immunogenic polypeptide derived from any subtype other than the first subtype. 

In any immunization method using, for example, a mixed polynucleotide prime 

30 (i.e., two or more polynucleotides encoding immunogenic HIV polypeptides derived 
from two or more HIV subtypes, serotypes, or strains) in conjunction with a 
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polypeptide boost the present invention includes using reduced doses of each single 
component to provide an equivalent immune response to using full doses of each 
component. In one embodiment, the high threshold of DNA is the maximum tolerable 
dose of DNA (e.g., about 5 mg to about 10 mg total DNA), the low threshold of DNA 
5 is the minimum effective dose (e.g., about 2 ug to about 10 ug total DNA), the high 
threshold of protein is the maximum tolerable dose of protein (e.g., about 1 mg total 
protein), the low threshold of protein is the minimum effective dose (e.g., about 2 ug 
total protein). Experiments performed in support of the present invention 
demonstrated the efficacy of dividing the total DNA dose among the polynucleotides 

10 of the polynucleotide component (e.g., Example 4). Further, experiments performed 
in support of the present invention (e.g., Example 4) demonstrated the efficacy of 
dividing the total polypeptide dose among the polypeptides comprising the 
polypeptide component. The total DNA and total protein are both typically above the 
low threshold values. 

15 In a preferred embodiment, the total amount of DNA in a given DNA 

immunization has a high threshold of less than or equal to about 10 mg total DNA and 
greater than or equal to 1 mg total DNA, and the total amount of protein in a given 
polypeptide boost has a high threshold of less than or equal to about 200 ug total 
protein product and greater than or equal to 10 ug of total protein. For example, in an 

20 embodiment using a polynucleotide component having two DNA molecules each 
encoding an immunogenic HIV polypeptide the dose of each DNA molecule per 
subject may be one milligram of each DNA molecule encoding an immunogenic HIV 
polypeptide, for a total of 2 mg for the two DNA molecules, or 0.5 mg of each DNA 
molecule encoding an immunogenic HIV polypeptide, for a total of 1 mg for the two 

25 DNA molecules. Dosing with the polypeptide component may be similarly varied, for 
example, using a polypeptide component having two immunogenic HIV polypeptides 
the dose of each polypeptide per subject may be 100 micrograms of each 
immunogenic HIV polypeptide, for a total of 200 ug for the two polypeptides, 50 
micrograms of each immunogenic HIV polypeptide, for a total of 100 ug for the two 

30 polypeptides, or 25 ug of each immunogenic HIV polypeptide, for a total of 50 ug for 
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the two polypeptides. As described above, more than two polypeptides may be 
included in the polypeptide component of the present invention. 

In further embodiments, the polynucleotide component of the present invention 
may comprise one or more gene delivery vectors comprising the polynucleotide(s) 
5 encoding immunogenic HIV polypeptide(s). The polypeptide component of the 
present invention may comprise an adjuvant in addition to the immunogenic 
polypeptide(s). The present invention also comprises a method for generating an 
immune response in a subject, the method comprising, administering the 
polynucleotide composition to the subject under conditions that are compatible with 

10 expression of the polynucleotide(s) encoding immunogenic HIV polypeptide(s) in the 
subject, and administering the polypeptide composition to the subject. The 
administering of polynucleotide and polypeptide compositions may be concurrent or 
sequentially. In a preferred embodiment immunization with a polynucleotide 
component precedes immunization with at least one polypeptide component. Further, 

15 a single prime may be followed by multiple boosts or a series of primes and boosts 
may be used. 

Exemplary envelope proteins, and coding sequences thereof, for use in the 
present invention include, but are not limited to, gpl20, gpl40, oligomeric gpl40, and 
gpl60, including mutated or modified forms thereof (e.g., deletion of the V2 loop, 

20 mutations in cleavage sites, or mutations in glycosylation sites). In one embodiment, 
HIV envelope polypeptides that have been modified to expose the region of their 
polypeptide that binds to the CCR5 receptor are useful in the practice of the present 
invention, as well as polynucleotide sequences encoding such polypeptides. From the 
perspective of humoral immunity, it is useful to generate an immune response that 

25 provides neutralization of primary isolates that utilize the CCR5 chemokine co- 
receptor, which is believed to be important for virus entry (Zhu, T., et al. (1993) 
Science 261:1 179-1 181; Fiore, J., et al. (1994) Virology; 204:297-303). These and 
other exemplary polynucleotide constructs (e.g., a variety of envelope protein coding 
sequences), methods of making the polynucleotide constructs, corresponding 

30 polypeptide products, and methods of making polypeptides useful for HIV 

immunization have been previously described, for example, in the following: PCT 
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International Publication Nos.: WO/00/39302; WO/00/39304; WO/02/04493; 
WO/03/004657; WO/03/004620; and WO/03/020876; US Patent No. 6,602,705; and 
US Published Patent Application Nos. 20030143248 , and 20020146683. 

Although described with reference to HW subtypes B and C as exemplary 
5 subtypes, the compositions and methods of the present invention are applicable to a 
wide variety of HrV subtypes, serotypes, or strains and immunogenic polypeptides 
encoded thereby, including but not limited to the following: HIV-1 subtypes, A 
through K, N and O, the identified CRFs (circulating recombinant forms), and HIV-2 
strains and its subtypes. See, e.g., Myers, et al., Los Alamos Database, Los Alamos 

10 National Laboratory, Los Alamos, New Mexico; Myers, et al., Human Retroviruses 
and Aids, 1990, Los Alamos, New Mexico: Los Alamos National Laboratory. 

Further modifications of Env include, but are not limited to, generating 
polynucleotides that encode Env polypeptides having mutations and/or deletions 
therein. For instance, some or all of hypervariable regions, VI, V2, V3, V4 and/or V5 

1 5 can be deleted or modified as described herein, particularly regions VI , V2, and V3. 
VI and V2 regions may mask CCR5 co-receptor binding sites. (See, e.g., Moulard, et 
al. (2002) Proc. Nat'l Acad. Sci 14:9405-9416). Accordingly, in certain embodiments, 
some or all of the variable loop regions are deleted, for example to expose potentially 
conserved neutralizing epitopes. Further, deglycosylation of N-linked sites are also 

20 potential targets for modification inasmuch as a high degree of glycosylation also 
serves to shield potential neutralizing epitopes on the surface of the protein. 
Additional optional modifications, used alone or in combination with variable region 
deletes and/or deglycosylation modification, include modifications (e.g., deletions) to 
the beta-sheet regions (e.g., as described in WO 00/39303), modifications of the leader 

25 sequence (e.g., addition of Kozak sequences and/or replacing the modified wild type 
leader with a native or sequence-modified tpa leader sequence) and/or modifications to 
protease cleavage sites (e.g., Chakrabarti, et al., (2002) J. Virol. 76(1 1):5357-5368 
describing a gpl40 Delta CFI containing deletions in the cleavage site, fusogenic 
domain of gp41, and spacing of heptad repeats 1 and 2 of gp41 that retained native 

30 antigenic conformational determinants as defined by binding to known monoclonal 
antibodies or CD4, oligomer formation, and virus neutralization in vitro). 
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Various combinations of these modifications can be employed to generate 
wild-type or synthetic polynucleotide sequences as described herein. 

Modification of the Env polypeptide coding sequences may result in (1) 
improved expression relative to the wild-type coding sequences in a number of 
5 mammalian cell lines (as well as other types of cell lines, including, but not limited to, 
insect cells), and/or (2) improved presentation of neutralizing epitopes. Similar Env 
polypeptide coding sequences can be obtained, modified and tested for improved 
expression from a variety of isolates. 

Any of the polynucleotides (e.g., expression cassettes) or polypeptides 

10 described herein (delivered by any of the methods described above) can also be used 
in combination with other DNA delivery systems and/or protein delivery systems. 
Non-limiting examples include co-administration of these molecules, for example, in 
prime-boost methods where one or more molecules are delivered in a "priming" step 
and, subsequently, one or more molecules are delivered in a "boosting" step. In 

1 5 certain embodiments, the delivery of one or more nucleic acid-containing 
compositions is followed by delivery of one or more nucleic acid-containing 
compositions and one or more polypeptide-containing compositions (e.g., 
polypeptides comprising HIV antigens). In other embodiments, multiple nucleic acid 
"primes" (of the same or different nucleic acid molecules) can be followed by multiple 

20 polypeptide "boosts" (of the same or different polypeptides). Other examples include 
multiple nucleic acid administrations and multiple polypeptide administrations. 

In any method involving co-administration, the various compositions can be 
delivered in any order. Thus, in embodiments including delivery of multiple different 
compositions or molecules, the nucleic acids need not be all delivered before the 

25 polypeptides. For example, the priming step may include delivery of one or more 

polypeptides and the boosting comprises delivery of one or more nucleic acids and/or 
one more polypeptides. Multiple polypeptide administrations can be followed by 
multiple nucleic acid administrations or polypeptide and nucleic acid administrations 
can be performed in any order. Thus, one or more or the nucleic acid molecules (e.g., 

30 expression cassettes) described herein and one or more of the polypeptides described 
herein can be co-administered in any order and via any administration routes. 
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Therefore, any combination of polynucleotides and polypeptides described herein can 
be used to elicit an immune reaction. 

In addition, following prime-boost regimes (such as those of the present 
invention described herein) may be beneficial to help reduce viral load in infected 
subjects, as well as possibly slow or prevent progression of HIV-related disease 
(relative to untreated subjects). 

Experimental 

Below are examples of specific embodiments for carrying out the present 
invention. The examples are offered for illustrative purposes only, and are not 
intended to limit the scope of the present invention in any way. 

Efforts have been made to ensure accuracy with respect to numbers used (e.g., 
amounts, temperatures, etc.), but some experimental error and deviation should, of 
course, be allowed for. 

Example 1 

Generation of Synthetic Expression Cassettes 
Generating Synthetic Polynucleotides 

The polynucleotide sequences used in the practice of the present invention are 
typically manipulated to maximize expression of their gene products in a desired host 
or host cell. Following here is some exemplary guidance concerning codon 
optimization and functional varients of HIV polypeptides. The order of the following 
steps may vary. 

First, the HIV-1 codon usage pattern may be modified so that the resulting 
nucleic acid coding sequence is comparable to codon usage found in highly expressed 
human genes. The HIV codon usage reflects a high content of the nucleotides A or T 
of the codon- trip let. The effect of the HIV-1 codon usage is a high AT content in the 
DNA sequence that results in a high AU content in the RNA and in a decreased 
translation ability and instability of the mRNA. In comparison, highly expressed 
human codons prefer the nucleotides G or C. Wild-type polynucleotide sequences 
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encoding polypeptides are typically modified to be comparable to codon usage found 
in highly expressed human genes. 

Second, for some genes variants are created (e.g., mutated forms of the wild- 
type polypeptide). In the following table (Table 2) mutations affecting the activity of 
5 several HIV genes are disclosed. 



Table 2 



Gene 


"Region" 


Exemplary Mutations 


Pol 


prot 


Att = Reduced activity by attenuation of Protease 
(Thr26Ser) (e.g., Konvalinka et al., 1995, J Virol 69: 
7180-86) 

Ina = Mutated Protease, nonfunctional enzyme 
(Asp25Ala)(e.g., Konvalinka et al., 1995, J Virol 69: 
7180-86) 




RT 


YM = Deletion of catalytic center (YMDD AP; SEQ ID 
NO:7) (e.g., Biochemistry, 1995, 34, 5351, Patel et. al.) 
WM = Deletion of primer grip region (WMGY_PI; SEQ 
ID NO:8) (e.g., J Biol Chem, 272, 17, 1 1 157, 
Palaniappan, et. al., 1997) 




RNase 


no direct mutations, RnaseH is affected by "WM" 
mutation in RT 




Integrase 


1) Mutation of HHCC domain, Cys40Ala (e.g., 

Wiskerchen et. al., 1995, J Virol, 69: 376). 

2.) Inactivation catalytic center, Asp64Ala, Aspl 16 Ala, 

Glul52Ala(e.g., Wiskerchen et. al., 1995, J Virol, 69: 

376). 

3) Inactivation of minimal DNA binding domain 
(MDBD), deletion of Trp235(e.g., Ishikawa et. al., 1999, 
J Virol, 73:4475). 

Constructs int.opt.mut.SF2 and int.opt.mut_C (South 
Africa TV1) both contain all these mutations (1,2, and 
3) 
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Gene "Region" 


Exemplary Mutations 


Env 


Mutations in cleavage site (e.g., Earl et al. (1990) PNAS 
USA 87:648-652; Earl et al. (1991)/. Virol. 65:31-41). 

Mutations in glycosylation site (e.g., GM mutants, for 
example, change Q residue in VI and/or V2 to N 
residue; may also be designated by residue altered in 
sequence) 

Deletions or modifications of the VI, V2, V3, V4 or V5 
regions or combinations thereof. (See e.g., US 6602705) 

Deletions or modifications of the /3-sheets regions. (See 
e.g., WO 00/39303) 


Tat 


Mutants of Tat in transactivation domain (e.g., Caputo et 
al., 1996, Gene Ther. 3:235), e.g., cys22 mutant 
(Cys22Gly), cys37 mutant (Cys37Ser), and double 
mutants 


Rev 


Mutations in Rev domains (e.g., Thomas et al., 1998, J 
Virol. 72:2935-44), e.g., mutation in RNA binding- 
nuclear localization ArgArg38,39AspLeu, mutations 
in activation domain LeuGlu78,79AspLeu = M10 


Nef 


Mutations of myristoylation signal and in 
oligomerization domain, for example: 

1. Single point mutation myristoylation signal: 
Gly-to-Ala 

2. Deletion of N-terminal first 18 (sub-type B, e.g., 
SF162) or 19 (sub-type C, e.g., South Africa clones) 
amino acids. 

(e.g., Peng and Robert-Guroff, 2001, Immunol Letters 
78: 195-200) 

Single point mutation oligomerization: 
(e.g., Liu et al., 2000, J Virol 74: 5310-19) 

Mutations affecting (1) infectivity (replication) of HIV- 
virions and/or (2) CD4 down regulation, (e.g., 
Lundquist et al. (2002) J Virol. 76(9):4625-33) 


Vif 


Mutations of Vif: 

e.g., Simon et al., 1999, J Virol 73:2675-81 



99 



PP21 100.001 
PATENT 



Gene "Region" 


Exemplary Mutations 


Vpr 


Mutations of Vpr: 

e.g., Singh et al., 2000, J Virol 74: 10650-57 


Vpu 


Mutations of Vpu: 

e.g., Tiganos et al., 1998, Virology 251: 96-107 



Exemplary polynucleotides comprising some of these mutations have been 
previously described ( see, e.g., PCT International Publication Nos.: WO/00/39302; 
WO/00/39303; WO/00/39304; WO/02/04493; WO/03/004657; WO/03/004620; and 
5 WO/03/020876). Reducing or eliminating the function of the associated gene 

products can be accomplished employing the teachings set forth in the above table, in 
view of the teachings of the present specification. 

In one aspect, the present invention comprises Env coding sequences that 
include, but are not limited to, polynucleotide sequences encoding the following HIV- 

10 encoded polypeptides: gpl60, gpl40, and gpl20 (see, e.g., U.S. Patent No. 5,792,459 
for a description of the HIV-1 S F2 ("SF2") Env polypeptide). The relationships 
between these polypeptides is shown schematically in Figure 3 (in the figure: the 
polypeptides are indicated as lines, the amino and carboxy termini are indicated on the 
gpl60 line; the open circle represents the oligomerization domain; the open square 

1 5 represents a transmembrane spanning domain (TM); and "c" represents the location of 
a cleavage site, in gpl40.mut the "X" indicates that the cleavage site has been mutated 
such that it no longer functions as a cleavage site). The polypeptide gpl60 includes 
the coding sequences for gpl20 and gp41. The polypeptide gp41 is comprised of 
several domains including an oligomerization domain (OD) and a transmembrane 

20 spanning domain (TM). In the native envelope, the oligomerization domain is 

required for the non-covalent association of three gp41 polypeptides to form a trimeric 
structure: through non-covalent interactions with the gp41 trimer (and itself), the 
gpl20 polypeptides are also organized in a trimeric structure. A cleavage site (or 
cleavage sites) exists approximately between the polypeptide sequences for gpl20 and 

25 the polypeptide sequences corresponding to gp41. This cleavage site(s) can be mutated 
to prevent cleavage at the site. The resulting gpl40 polypeptide corresponds to a 
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truncated form of gpl60 where the transmembrane spanning domain of gp41 has been 
deleted. This gpl40 polypeptide can exist in both monomeric and oligomeric (i.e. 
trimeric) forms by virtue of the presence of the oligomerization domain in the gp41 
moiety. In the situation where the cleavage site has been mutated to prevent cleavage 
5 and the transmembrane portion of gp41 has been deleted the resulting polypeptide 
product is designated "mutated" gpl40 (e.g., gpl40.mut). As will be apparent to those 
in the field, the cleavage site can be mutated in a variety of ways. (See, also, e.g., PCT 
International Publication Nos. WO 00/39302 and WO/02/04493). 

Wild-type HIV coding sequences (e.g., Gag, Env, Pol, tat, rev, nef, vpr, vpu, 

10 vif, etc.) can be selected from any known HIV isolate and these sequences 

manipulated to maximize expression of their gene products following the teachings of 
the present invention. The wild-type coding region maybe modified in one or more of 
the following ways: sequences encoding hypervariable regions of Env, particularly VI 
and/or V2 are deleted, and/or mutations are introduced into sequences, for example, 

15 encoding the cleavage site in Env to abrogate the enzymatic cleavage of oligomeric 
gpl40 into gpl20 monomers. (See, e.g., Earl et al. (1990) PNAS USA 87:648-652; 
Earl et al. (1991) J. Virol. 65:31-41). In yet other embodiments, hypervariable 
region(s) are deleted, N-glycosylation sites are removed and/or cleavage sites mutated. 
As discussed above, different mutations may be introduced into the coding sequences 

20 of different genes (see, e.g., Table 2). 

To create the synthetic coding sequences of the present invention the gene 
cassettes are designed to comprise the entire coding sequence of interest. Synthetic 
gene cassettes are constructed by oligonucleotide synthesis and PCR amplification to 
generate gene fragments. Primers are chosen to provide convenient restriction sites 

25 for subcloning. The resulting fragments are then ligated to create the entire desired 
sequence which is then cloned into an appropriate vector. The final synthetic 
sequences are (i) screened by restriction endonuclease digestion and analysis,(ii) 
subjected to DNA sequencing in order to confirm that the desired sequence has been 
obtained and (iii) the identity and integrity of the expressed protein confirmed by 

30 SDS-PAGE and Western blotting. The synthetic coding sequences are assembled at 
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Chiron Corp. (Emeryville, CA) or by the Midland Certified Reagent Company 
(Midland, Texas). 

Percent identity to the synthetic sequences of the present invention can be 
determined, for example, using the Smith- Waterman search algorithm (Time Logic, 
5 Incline Village, NV), with the following exemplary parameters: weight matrix = 

nuc4x4hb; gap opening penalty = 20, gap extension penalty = 5, reporting threshold = 
1 ; alignment threshold = 20. 

Various forms of the different embodiments of the present invention (e.g., 
constructs) may be combined. 
10 Some exemplary embodiments of synthetic polynucleotides useful in the 

practice of the present invention are discussed in Example 4 and presented in Figure 6 
to Figure 19. 



B. Creating Expression Cassettes Comprising the Synthetic Polynucleotides of the 

15 Present Invention 

The synthetic DNA fragments of the present invention may be cloned into a 
number of viral or non-viral expression vectors. For example, polynucleotides used in 
the practice of the present invention may be cloned into the following non- viral 
expression vectors: (i) pCMVKm2, for transient expression assays and DNA 

20 immunization studies, the pCMVKm2 vector was derived from pCMV6a (Chapman et 
al., Nuc. Acids Res. (1991) 19:3979-3986) and comprises a kanamycin selectable 
marker, a ColEl origin of replication, a CMV promoter enhancer and Intron A, 
followed by an insertion site for the synthetic sequences described below followed by 
a polyadenylation signal derived from bovine growth hormone ~ the pCMVKm2 

25 vector differs from the pCMV-link vector only in that a polylinker site was inserted 
into pCMVKm2 to generate pCMV-link; (ii) pESN2dhfr and pCMVPLEdhfr (also 
known as pCMVIII), for expression in Chinese Hamster Ovary (CHO) cells; and, (iii) 
pAcC13, a shuttle vector for use in the Baculovirus expression system (pAcC13, was 
derived from pAcC12 which was described by Munemitsu S., et al., Mol Cell Biol. 

30 10(1 1):5977-5982, 1990). See, also PCT International Publication Nos. WO 
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00/39302, WO 00/39303, WO 00/39304, WO 02/04493 for a description of these 
vectors. 

Briefly, construction of pCMVPLEdhfr (pCMVIII) was as follows. To 
construct a DHFR cassette, the EMCV IRES (internal ribosome entry site) leader was 
5 PCR-amplified from pCite-4a+ (Novagen, Inc., Milwaukee, WI) and inserted into 
pET-23d (Novagen, Inc., Milwaukee, WI) as an Xba-Nco fragment to give pET- 
EMCV. The dhfr gene was PCR-amplified from pESN2dhfr to give a product with a 
Gly-Gly-Gly-Ser spacer in place of the translation stop codon and inserted as an Nco- 
BamRX fragment to give pET-E-DHFR. Next, the attenuated neo gene was PCR 

10 amplified from a pSV2Neo (Clontech, Palo Alto, CA) derivative and inserted into the 
unique BamRl site of pET-E-DHFR to give pET-E-DHFR/Neo (m 2). Then, the bovine 
growth hormone terminator from pCDNA3 (Invitrogen, Inc., Carlsbad, CA) was 
inserted downstream of the neo gene to give pET-E-DHFR/Neo (m 2)BGHt. The 
EMCV -dhfr/ neo selectable marker cassette fragment was prepared by cleavage of 

1 5 pET-E-DHFR/NeO( m 2)BGHt. The CMV enhancer/promoter plus Intron A was 

transferred from pCMV6a (Chapman et al., Nuc. Acids Res. (1991) 19:3979-3986) as a 
Hindlll-Sall fragment into pUC19 (New England Biolabs, Inc., Beverly, MA). The 
vector backbone of pUC19 was deleted from the Ndel to the Sapl sites. The above 
described DHFR cassette was added to the construct such that the EMCV IRES 

20 followed the CMV promoter to produce the final construct. The vector also contained 
an amp r gene and an SV40 origin of replication. 

Expression vectors of the present invention may comprise one or more 
polynucleotide sequence encoding immunogenic polypeptides. When the expression 
cassette contains more than one coding sequence the coding sequences may all be in- 

25 frame to generate one polyprotein; alternatively, the more than one polypeptide coding 
sequences may comprise a polycistronic message where, for example, an IRES is 
placed 5' to each polypeptide coding sequence; further, multiple promoters may be 
present to direct the expression of multiple coding sequences. 
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Example 2 

In Vivo Immunogenicity in Mice of Synthetic HIV Expression Cassettes and 
Polypeptides Encoded Thereby 

5 A. Immunization 

To evaluate the immunogenicity of the compositions of the present invention 
used to induce immune response, a mouse study may be performed. The 
polynucleotide component (e.g., two pCMVlink-based plasmids each carrying codon 
optimized coding sequences for gpl40 with delV2, the first coding sequence derived 

10 from SF162, subtype B, and the second coding sequence derived from TV1, subtype 
C), is diluted in a total injection volume of 100 ul using varying doses of DNA (0.02 - 
200ug). To overcome possible negative dilution effects of the diluted DNA, the total 
DNA concentration in each sample can be adjusted using the vector (e.g., pCMVlink) 
alone. Groups of 10-12 Balb/c mice (Charles River, Boston, MA) are intramuscularly 

15 immunized (50 ul per leg, intramuscular injection into the tibialis anterior) using 
varying dosages. 

The mice are then immunized with oligomeric, codon optimized, gpl40 with 
delV2, derived from SF162, subtype B, polypeptide at intervals following the DNA 
immunization using appropriate concentrations of polypeptide. 

20 

B. Humoral Immune Response 

The humoral immune response is checked with a suitable anti-HIV antibody 
ELISAs (enzyme-linked immunosorbent assays) of the mice sera 0 and 2-4 week 
intervals post immunization. 

25 The antibody titers of the sera are determined by anti-HIV antibody ELISA. 

Briefly, sera from immunized mice are screened for antibodies directed against HIV 
envelope protein. ELISA microtiter plates are coated with 0.2 ug of HIV envelope 
gpl40 protein per well overnight and washed four times; subsequently, blocking is 
done with PBS-0.2% Tween (Sigma) for 2 hours. After removal of the blocking 

30 solution, 100 ul of diluted mouse serum is added. Sera are tested at 1/25 dilutions and 
by serial 3 -fold dilutions, thereafter. Microtiter plates are washed four times and 
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incubated with a secondary, peroxidase-coupled anti-mouse IgG antibody (Pierce, 
Rockford, IL). ELISA plates are washed and 100 ul of 3, 3', 5, 5'-tetramethyl 
benzidine (TMB; Pierce) is added per well. The optical density of each well is 
measured after 15 minutes. The titers reported are the reciprocal of the dilution of 
5 serum that gave a half-maximum optical density (O.D.). 

The results of these assays are used to show the potency of the 
polynucleotide/polypeptide immunization methods of the present invention for the 
generation of an immune response in mice. 

10 C. Cellular Immune Response 

The frequency of specific cytotoxic T-lymphocytes (CTL) is evaluated by a 
standard chromium release assay of peptide pulsed Balb/c mouse CD4 cells. HIV 
protein-expressing vaccinia virus infected CD-8 cells may be used as a positive 
control (vv-protein). Briefly, spleen cells (Effector cells, E) are obtained from the 

15 BALB/c mice (immunized as described above). The cells are cultured, restimulated, 
and assayed for CTL activity against, e.g., Envelope peptide-pulsed target cells (see, 
e.g., Doe, B., and Walker, CM., AIDS 10(7):793-794, 1996, for a general description 
of the assay). Cytotoxic activity is measured in a standard 51 Cr release assay. Target 
(T) cells are cultured with effector (E) cells at various E:T ratios for 4 hours and the 

20 average cpm from duplicate wells is used to calculate percent specific 51 Cr release. 
Antigen specific T cell responses in immunized mice can also be measured by flow 
cytometric determinations of intracellular cytokine production (Cytokine flow 
Cytometry or "CFC") as described by zur Megede, J., et al.„ in Expression and 
immunogenicity of sequence-modified human immunodeficiency virus type 1 subtype 

25 B pol and gagpol DNA vaccines, J Virol. 77:6197-207 (2003). 

Cytotoxic T-cell (CTL) or CFC activity is measured in splenocytes recovered 
from the mice immunized with HIV DNA constructs and polypeptides as described 
herein. Effector cells from the immunized animals typically exhibit specific lysis of 
HIV peptide-pulsed SV-BALB (MHC matched) targets cells indicative of a CTL 

30 response. Target cells that are peptide-pulsed and derived from an MHC-unmatched 
mouse strain (MC57) are not lysed. The results of the CTL or CFC assays are used to 
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show the potency of the polynucleotide/polypeptide immunization methods of the 
present invention for induction of cytotoxic T-lymphocyte (CTL) responses by DNA 
immunization. 

5 Example 3 

In Vivo Immunogenicity Studies 
A. General Immunization Methods 

To evaluate the immune response generated using the compositions 
(comprising a polynucleotide component and a polypeptide component) and methods 

10 of the present invention, studies using guinea pigs, rabbits, mice, rhesus macaques 
and/or baboons may be performed. The studies are typically structured as shown in 
the following table (Table 3) and can be carried out using, for example, the following 
components: subtype B DNA pCMVlink carrying a codon optimized coding 
sequences for gpl40 with delV2, the coding sequence derived from SF162, subtype B; 

15 subtype C DNA -- pCMVlink carrying a codon optimized coding sequences for gpl40 
with delV2, the coding sequence derived from TV1, subtype C; subtype B protein ~ 
oligomeric, codon optimized, gpl40 with delV2, derived from SF162, subtype B 
polypeptide; and subtype C protein -- oligomeric, codon optimized, gpl40 with delV2, 
derived from TV1, subtype C polypeptide. 

20 Table 3 



DNA 
Immunization 


Protein Immunization 


Subtype 
B 


Subtype 

C 


Subtype B&C 
(IX) 


Subtype B&C 
(2X) 


Subtype B 


X 


X 


X 


X 


Subtype C 


X 


X 


X 


X 


Subtype B & C 
(IX) 


X 


X 


X 


X 


Subtype B & C 
(2X) 


X 


X 


X 


X 



The immunizations may use single or multiple DNA immunizations and single 
or multiple protein immunizations. The immunizations in the above table exemplify 
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the following methods: prime/boost regimens (Subtype B DN A/Sub type B protein; 
Subtype C DNA/Subtype C protein); mixed prime/boost, single DNA prime and single 
-protein boost (Subtype B DNA/Subtype C protein; Subtype C DNA/Subtype B 
protein); mixed DNA prime single protein boost (Subtype B & C DNA/Subtype B 
5 protein; Subtype B & C DNA/Subtype C protein); single DNA prime mixed protein 
boost (Subtype B DNA/Subtype B & C protein; Subtype C DNA/Subtype B & C 
protein); and mixed DNA prime mixed protein boost (Subtype B& C DNA/Subtype B 
& C protein. The amount of each DNA and /or protein in the mixed samples (i.e, B & 
C, in this example) can be added at an amount equal to that delivered in the single 
10 imuunizations (such that 2X the amount of total DNA and/or protein is delivered) or 
the amount of each DNA and/or protein in the mixed samples can be adjusted so that 
the same total amount (IX) of DNA and/or protein is delivered in the mixed and 
single samples. 

In addition to examples in Table 3 exemplifying combinations of 
1 5 polynucleotide component and polypeptide component, other combinations 

exemplifying two polynucleotide or two polypeptide components can be mentioned. 
For example, continuing the above example using combinations of HIV subtype B and 
subtype C immunogens, the present invention also includes single DNA prime and 
single DNA boost (Subtype B DNA/Subtype C DNA); single protein prime and single 
20 protein boost (Subtype B protein/Subtype C protein). 

B. Mice 

Experiments may be performed in mice following the immunization protocol 
illustrated in Table 3 and using the methods essentially as described in Example 2. 

25 

C. Guinea Pigs 

Experiments may be performed in guinea pigs as follows. Groups comprising 
six guinea pigs each are immunized parenterally (e.g., intramuscularly or 
intradermally) or mucosally at 0, 4, and 12 weeks with plasmid DNAs comprising 
30 expression cassettes comprising one or more HIV immunogenic polypeptide (for 

example, gp 140 DNAs as descibed in Example 2) as illustrated in Table 3. A subset 
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of the animals are subsequently boosted at approximately 12-24 weeks with a single 
dose (intramuscular, intradermally or mucosally) of the HIV protein(s) (for example, 
gp 140 DNAs as descibed in Example 2) as illustrated in Table 3. Animals may be 
boosted subsequently multiple times at 8-16 week intervals with the HIV protein. 
5 Antibody titers (geometric mean titers) are measured at two weeks following the third 
DNA immunization and at two weeks after the protein boost. Results of these studies 
are used to demonstrate the usefulness of the compositions and methods of the 
invention to generate immune responses, in particular to generate broad and potent 
neutralizing activity against diverse HIV strains. 

10 

D. Rabbits 

Experiments may be performed in rabbits as follows. Rabbits are immunized 
intramuscularly or intradermally at multiple sites (using needle injection with or 
without subsequent electroporation, or using a Bioject needless syringe) or mucosally 

1 5 with with plasmid DNAs comprising expression cassettes comprising one or more 

HIV immunogenic polypeptide (for example, gp 140 DNAs as descibed in Example 2) 
as illustrated in Table 3. A subset of the animals are subsequently boosted with a 
single dose (intramuscular, intradermally or mucosally) of the HIV protein(s) (for 
example, gp 140 DNAs as descibed in Example 2) as illustrated in Table 3. Animals 

20 may be boosted multiple times with the HIV protein. Typically, the compositions of 
the present invention used to generate immune responses are highly immunogenic and 
generate substantial antigen binding antibody responses after only 2 immunizations in 
rabbits. Results of these studies are used to demonstrate the usefulness of the 
compositions and methods of the invention to generate immune responses, in 

25 particular to generate broad and potent neutralizing activity against diverse HIV 
strains. 

E. Rhesus Macaques 

Experiments may be performed in rhesus macaques as follows. Rhesus 
30 macaques are immunized at approximately 0, 4, 8, and 24 weeks parenterally or 
mucosally with plasmid DNAs comprising expression cassettes comprising one or 
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more HIV immunogenic polypeptide (for example, gp 140 DNAs as descibed in 
Example 2) as illustrated in Table 3. Enhanced DNA delivery systems such as use of 
DNA complexed to PLG microparticles or saline injection of DNA followed by 
electropoartion can be employed to increase immune response during the DNA 
5 priming phase of the immunization regimen. A subset of the animals are subsequently 
boosted with a single dose (intramuscular, intradermally or mucosally) of the HIV 
protein(s) (for example, gp 140 DNAs as descibed in Example 2) as illustrated in 
Table 3. Animals may be boosted multiple times generally at 3-6 month intervals with 
the HIV protein. Typically, the macaques have detectable HIV-specific T-cell 

1 0 responses as measured by CTL assays or Cytokine Flow Cytometry after two or three 
1 mg doses of the polynucleotide component. Neutralizing antibodies may also 
detected. Results of these studies are used to demonstrate the usefulness of the 
compositions and methods of the invention to generate immune responses, in 
particular to generate broad and potent neutralizing activity against diverse HIV 

15 strains. 

F. Baboons 

Baboons are immunized 4 times (at approximately weeks 0, 4, 8, and 24) 
intramuscular, or intradermally, or mucosally with plasmid DNAs comprising 

20 expression cassettes comprising one or more HIV immunogenic polypeptide (for 

example, gp 140 DNAs as descibed in Example 2) as illustrated in Table 3. The DNAs 
can be delivered in saline with or without electroporation, or on PLG microparticles. A 
subset of the animals are subsequently boosted with a single dose (intramuscular, 
intradermally or mucosally) of the HIV protein(s) (for example, gp 140 DNAs as 

25 descibed in Example 2) as illustrated in Table 3. Animals may be boosted multiple 
times generally at 3-6 month intervals with the HIV protein. 

The animals are bled two-four weeks after each immunization and an HIV 
antibody ELIS A is performed with isolated plasma. The ELIS A is performed 
essentially as described below in Section G except the second antibody-conjugate is 

30 typically an anti-human IgG, g-chain specific, peroxidase conjugate (Sigma Chemical 
Co., St. Louis, MD 63178) used at a dilution of 1:500. Fifty ug/ml yeast extract may 
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be added to the dilutions of plasma samples and antibody conjugate to reduce non- 
specific background due to preexisting yeast antibodies in the baboons. 
Lymphoproliferative responses to are typically observed in baboons post-boosting 
with HIV-polypeptide Such proliferation results are indicative of induction of T-helper 
5 cell functions. Results of these studies are used to demonstrate the usefulness of the 
compositions and methods of the invention to generate immune responses, in 
particular to generate broad and potent neutralizing activity against diverse HIV 
strains. 

10 G. Humoral Immune Response 

In any immunized animal model (including the above, as well as, for example, 
chimpanzees), the humoral immune response is checked in serum specimens from the 
immunized animals with an anti-HIV antibody ELISAs (enzyme-linked 
immunosorbent assays) at various times post-immunization. The antibody titers of the 

15 sera are determined by anti-HIV antibody ELISA as described above. Briefly, sera 
from immunized animals are screened for antibodies directed against the HIV 
polypeptide/protein(s) encoded by the DNA and/or polypeptide used to immunize the 
animals (e.g., oligomeric gpl40). Typically independent ELISA assays are carried out 
using polypeptides corresponding to each of the subtypes used in the immunization 

20 study. 

Wells of ELISA microtiter plates are coated overnight with the selected HIV 
polypeptide/protein and washed four times; subsequently, blocking is done with PBS- 
0.2% Tween (Sigma) for 2 hours. After removal of the blocking solution, 100 ul of 
diluted mouse serum is added. Sera are tested at 1/25 dilutions and by serial 3-fold 

25 dilutions, thereafter. Microtiter plates are washed four times and incubated with a 
secondary, peroxidase-coupled anti-mouse IgG antibody (Pierce, Rockford, IL). 
ELISA plates are washed and 100 ul of 3, 3', 5, 5'-tetramethyl benzidine (TMB; 
Pierce) was added per well. The optical density of each well is measured after 15 
minutes. Titers are typically reported as the reciprocal of the dilution of serum that 

30 gave a half-maximum optical density (O.D.). 

Cellular immune responses may also be evaluated. 
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The presence of neutralizing antibodies in the sera is determined essentially as 
follows: Virus neutralization is measured in 5.25.EGFP.Luc.M7 (M7-luc) cells 
obtained from Dr. Nathaniel Landau (Salk Institute, San Diego, CA). The format of 
this assay is essentially the same as the MT-2 assay as described elsewhere 
5 (Montefiori et al. (1 988) J. Clin Microbiol. 26:23 1 -235) except that virus infection is 
quantified by luciferase reporter gene expression using a commercial luciferase kit 
(Promega). All serum samples are heat-inactivated for 1 hour at 56°C prior to assay. 
The virus stocks of the HIV-1 isolates are typically generated in PBMC. 

10 Example 4 

Evaluation Of Immunogenicity Regimens For Various HIV Polypeptide Encoding 
Plasmids Used As Primes And Various HIV Polypeptides Used As Boosts 
To evaluate the combination effects of subtype C (TV1) and subtype B 
(SF162) pgl40dV2 DNAs and proteins for DNA prime/boost the following 

15 experiments were carried out in rabbits. DNA was gpl40mod.TVl.dV2 and 

gpl40mod.SF162.dV2, delivered separately in two plasmids (sources of DNA are 
described further herein below). Protein was oligomer o-gpl40.dV2.TVl and o- 
gpl40.dV2.SF162 (sources of the proteins are described further herein below). DNA 
constructs were used for immunization in three doses at schedules of 0, 4, 12 weeks. 

20 Proteins were boosted at 12, 24, and 41 weeks. Each rabbit was injected 1 .0 ml DNA 
mixture at two sides EVI/Quadriceps, followed by an electroporation procedure (G. 
Widera, Increased DNA vaccine delivery and immunogenicity by electroporation in 
vivo, J. Immunology, 164, 4635-4640 (2000)). MF59 adjuvanted protein was injected 
two sites, IM/Glut for 1ml per animal. 

25 All of the genes were sequence-modified to enhance expression of the encoded 

Env glycoproteins in a Rev-independent fashion and they were subsequently cloned 
into pCMV-based plasmid vectors for DNA vaccine and protein production 
applications as described above. The sequences were codon optimized as described 
herein. Briefly, all the modified envelope genes were cloned into the Chiron 

30 pCMVlink plasmid vector, preferably into EcoRI/XhoI sites. 
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To obtain gpl40 polypeptides each of the gpl40 contracts (i.e., 
gpl40mod.TVl.mut7.delV2 and gpl40.mut7.modSF162.delV2) were used in the 
following method. 

Chinese hamster ovary (CHO) cells were transfected with plasmid DNA 
5 encoding the gpl40 proteins (e.g., pCMV vector backbone) using Minis TransIT-LTl 
polyamine transfection reagent (Minis Corporation, Madison WI) according to the 
manufacturer's instructions and incubated for 96 hours. After 96 hours, media was 
changed to selective media (F12 special with 250 ug/ml G418) and cells were split 1 :5 
and incubated for an additional 48 hours. Media was changed every 5-7 days until 

10 colonies started forming at which time the colonies were picked, plated into 96 well 
plates and screened by gpl20 Capture ELISA. Positive clones were expanded in 24 
well plates and screened several times for Env protein production by Capture ELISA, 
as described above. After reaching confluency in 24 well plates, positive clones were 
expanded to T25 flasks (Corning, Corning, NY). These were screened several times 

1 5 after confluency and positive clones were expanded to T75 flasks. 

Positive T75 clones were frozen in liquid nitrogen and the highest expressing 
clones amplified with 0-5 uM methotrexate (MTX)at several concentrations and 
plated in 100 mm culture dishes. Plates were screened for colony formation and all 
positive closed were again expanded as described above. Clones were expanded, 

20 amplified and screened at each step by gpl20 capture ELISA. Positive clones were 
frozen at each methotrexate level. Highest producing clones were grown in perfusion 
bioreactors (3L, 100L) for expansion and adaptation to low serum suspension culture 
conditions for scale-up to larger bioreactors. 

The stably transfected CHO cell lines, which express the Env polypeptides, 

25 were used to produce gpl40 proteins. The proteins were purified, briefly, by using a 
three-step strategy as previously described (Srivastava, et al., Purification and 
characterization of oligomeric envelope glycoprotein from a primary r5 subtype B 
human immunodeficiency virus. J Virol 76:2835-47 (2002)). First, concentrated cell 
supernatants were passed over a Galanthus Nivalis-agarose column (GNA; Vector 

30 Laboratories, Burlingame, CA). The gpl40SF162AV2 protein bound to the column, 
and most contaminating proteins flowed through. The bound protein was eluted with 
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500 mM methyl mannose pyranoside (MMP). Next, the captured protein was passed 
over DEAE and CHAP columns. 

These methods are applicable to other HIV genes and proteins derived from 
other HIV subtypes. Further, although this analysis was carried out in rabbits similar 
5 analysis may be carried out with other type of animals, for example, as described in 
Example 3. The immunization weeks can be varied. 

The following table (Table 4) lists exemplified procedures used in a 
comparison of the immunogenicity of subtype B and C polynucleotides encoding 
envelope polypeptides (in a pCMVlink vector) in various combinations with subtype 
10 B and C envelope polypeptides, both individually and as a mixed-subtype vaccine, 

using electroporation, in rabbits. It will be apparent to one skilled in the art in view of 
the teachings of the present specification that such methods are equally applicable to 
any other polynucleotides encoding immunogenic HIV polypeptides and 
immunogenic HIV polypeptides. 

15 

Table 4 



Gro 
up 


Animal 

# 


Imm'n 

# 


Adjuvant 


Immunogen 


Total 
Dose 


Vol/ 
Site 


Sites/ 
Animal 


Route 


1 


1-4 


1,2, 3, 
4 


MF59C 


o-gpl40 dV2 SF162 


50ug 


500ul 


2 


IM/Glut 
(Needle) 


2 


5-8 


1,2,3, 
4 


Iscomatrix 


o-gpl40dV2 SF162 


50ug 


500ul 


2 


IM/Glut 
(Needle) 


3 


9-12 


1,2,3 




pCMV 140 dV2 SF162 
DNA 


l.Omg 


0.50ml 


2 


IM/Quad 
(Needle) 






3,4 


MF59C 


o-gpl40dV2 SF162 


50ug 


500ul 


2 


IM/Glut 
(Needle) 


4 


13 -16 


1,2,3 




pCMV 140 dV2 SF162 
DNA 


l.Omg 


0.5ml 


2 


IM/Quad 
(Needle) 






3,4 


Iscomatrix 


o-gpl40 dV2 SF162 


50ug 


500ul 


2 


IM/Glut 
(Needle) 


5 


17-20 


1,2,3,4 


' MF59C 


o-gpl40dV2TVl 


50ug 


500ul 


2 


IM/Glut 
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Gro 
up 


Animal 

# 


Imm'n 

# 


Adjuvant 


Immunogen 


Total 
Dose 


V 1/ 
Site 


Sites/ 
Animal 


Route 


















(Needle) 


6 


21-24 


1,2,3 




pCMV 140 dV2 TV1 
DNA 


l.Omg 


0.5ml 


2 


IM/Quad 
(Needle) 






3,4 


MF59C 


o-gpl40dV2 SF162 


50ug 


500ul 


2 


IM/Glut 
(Needle) 


7 


25-28 


1,2,3 
3,4 


MF59C 


pCMV 140 dV2 SF162 
DNA 

pCMV 140dV2TVl 
DNA 

o-gpl40 dV2 SF162 


2.0mg 
(l.Omg 
ea.) 

50ug 


0.5ml 
500ul 


2 
2 


IM/Quad 
(Needle) 

IM/Glut 
(Needle) 


8 


29-32 


1,2,3 
3,4 


MF59C 


pCMV 140 dV2 SF162 
DNA 

pCMV 140 dV2 TV1 
DNA 

o-gpl40 dV2 TV1 


2.0mg 
50ug 


0.5ml 
500ul 


2 
2 


IM/Quad 
(Needle) 

IM/Glut 
(Needle) 


9 


33-36 


1,2,3 
3,4 


MF59C 


pCMV 140 dV2 SF162 
DNA 

pCMV 140dV2TVl 
DNA 

o-gpl40 dV2 SF162 
o-gpl40dV2 TV1 


2.0mg 
lOOug 


0.50ml 
500ul 


2 
2 


IM/Quad 
(Needle) 

IM/Glut 
(Needle) 


10 


37-40 


1,2,3 
3,4 


MF59C 


pCMV 140 dV2 SF162 
DNA 

pCMV 140 dV2 TV1 
DNA 

o-gpl40 dV2 SF162 
o-gpl40 dV2 TV1 


2.0mg 
50ug 


0.5ml 
500ul 


2 
2 


IM/Quad 
(Needle) 

IM/Glut 
(Needle) 


11 


41-44 


1,2,3 
3,4 


MF59C 


pCMV 140 dV2 SF162 
DNA 

pCMV 140 dV2 TV1 
DNA 

o-gpl40dV2SF162 


l.Omg 
50ug 


0.50ml 
500ul 


2 
2 


IM/Quad 
(Needle) 

IM/Glut 
(Needle) 



The MF59C adjuvant is a microfluidized emulsion containing 5% squalene, 
0.5% Tween 80, 0.5% Span 85, in lOmM citrate pH 6, stored in 10 ml aliquots at 4°C. 
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The Iscomatrix adjuvant is a quil saporin based adjuvant used for protein 
delivery (available from, e.g., CSL Limited,Victoria, Australia). 

The polynucleotides and polypeptides listed in Table 4 were prepared as 
described in Table 5. 
5 Table 5 



Polynucleotide Construct / 
Polypeptide 


Description 


pCMV 140 dV2 SF162 DNA 


The plasmid (pCMVlink) contained a synthetic, 
codon optimized HIV-1 gpl40 envelope gene 
from subtype B strain SF162 (see, 
gpl40.modSF162.delV2, Figure 6, see also 
PCT International Publication No. 
WO/00/39302). The gpl40 gene comprised the 
gpl20 and gp41 ectodomain. The constructs 
also contained a deletion in the variable region 
V2 (dV2).The plasmid construct contained the 
human CMV enhancer/promoter and 
Kanamycin resistance gene. Plasmids were 
prepared by alkaline lysis method and Qiagen 
purification from DH5--DnE.coli bacteria. 
Plasmids were stored at -80C until use. 


pCMV 140 dV2 TV1 DNA 


The plasmid (pCMVlink) contained a synthetic, 
codon optimized HIV-1 gpl40 envelope gene 
derived from HIV-1 subtype C strain TV1 (see, 
gpl40mod.TVl.delV2, Figure 8, see also PCT 
International Publication No. WO/02/04493). 
The structure of the envelope gene and the 
plasmid was as described above. 


o-gpl40 dV2 SF162 protein 


The subtype B oligomer protein contained five 
amino acid mutations in the cleavage site in 
addition to the deletion of V2 region (see, 
gpl40.mut7.modSF162.delV2, Figure 7, see 
also PCT International Publication No. 
WO/00/3930). Protein was expressed in CHO 
cells and purified from the CHO cells. 
Expression and purification of o-gpl40 proteins 
was described, for example, in PCT 
International Publication No. WO/00/39302 and 
Srivastava, et al., J Virol 76:2835-47 (2002). 


o-gpl40 dV2 TV1 protein 


The subtype C oligomer protein contained five 
amino acid mutations in the cleavage site in 
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Polynucleotide Construct / 
Polypeptide 


Description 




addition to the deletion of V2 region (see, 
gpl40mod.TVl.mut7.delV2, Figure 9, see also 
PCT International Publication No. 
WO/02/04493). Protein was expressed in CHO 
cells and purified from the CHO cells. 
Expression and purification of o-gpl40 proteins 
was described, for example, in PCT 
International Publication No. WO/00/39302 and 
Srivastava, et al., J Virol 76:2835-47 (2002). 



Immunogens were prepared as described in the following table (Table 6) for 
administration to animals in the various groups. 

Table 6 

5 



Group 


Preparation 


1,5 


Immunization 1-4: Protein Immunization + MF59 

Protein doses were 50ug protein per snimsl. Xhe initio! protein W3.S 
diluted to 0.100 mg/ml in citrate buffer. Stored at -80°C until use. 
Thawed at room temperature; material was clear with no particulate 
matter. Added equal volume of MF59C adjuvant to thawed protein and 
mixed well by inverting the tube. Immunized each rabbit with 0.5ml 
adjuvanted protein per side, IM/Glut for a total of 1ml per animal. Used 
material within 1 hour of the addition of adjuvant. Needles were used for 
injections. 


2 


Immunization 1-4: Protein Immunization + Iscomatrix 

The stock concentration was lmg/ml. Immediately before 
immunizations, 250ul of lmg/ml Iscomatrix was diluted to 2.5ml of 
0. lmg/ml with PBS (CFU U21). Added equal volume (2.5ml) of 
0.1 mg/ml Iscomatrix into 2.5ml of 0.1 mg/ml protein and mixed well. 
Immunized each rabbit with 0.5ml adjuvanted protein per side, EM/Glut 
for a total of 1ml per animal. 


3- 4,6 


Immunization 1—3: Subtype B/C plasmid DNA in Saline 

The immunogen was provided at 1.0 mg/ml total DNA in sterile 0.9% 
saline. Stored at -80°C until use. Thawed DNA at room temperature; the 
material was clear or slightly opaque, with no particulate matter. 
Immunized each rabbit with 0.5ml DNA mixture per side 
(EVI/Quadriceps), total 2 sides with 1.0ml per animal. Animals were 
shaved prior to immunization, under sedation of lx dose IP (by animal 
weight) of Ketamine-Xylazine (80mg/ml - 4mg/ml). DNA injection used 
needle. Following the DNA injection, electroporation was administrated 
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Group 


Preparation 


3,6 
4 


using a 6-needle circular array with 1cm diameter, 1cm needle length. 
Electroporation pulses were given at 20V/mm, 50ms pulse length, 1 
pulse/s. 

Immunization 3-4: Protein Immunization 

Protein doses were 50ug each SF162 protein per animal. The initial 
SF162 Protein was diluted to 0.100 mg/ml in citrate buffer. Stored at - 
80°C until use. Thawed at room temperature; material was clear with no 
particulate matter. Added equal volume of MF59C adjuvant to thawed 
protein and mixed well by inverting the tube. Immunized each rabbit 
with 0.5ml adjuvanted protein per side, IM/Glut for a total of 1ml per 
animal. Used material within 1 hour of the addition of adjuvant. Needles 
were used for injections. 

Immunization 3-4: Protein immunization 

The stock concentration was 1 mg/ml. Immediately before 
immunizations, Iscomatrix was diluted to O.lmg/ml with PBS (CFU 
U21). Added equal volume of O.lmg/ml Iscomatrix into the O.lmg/ml 
protein and mixed well. Immunized each rabbit with 0.5ml adjuvanted 
protein per side, IM/Glut for a total of 1ml per animal. 


7-8, 10 


Immunization 1--3: Subtype B/C plasmid DNA in Saline 

The immunogen was provided at 2.0mg/ml total DNA in sterile 0.9% 
saline. Stored at -80°C until use. Thawed DNA at room temperature; the 
material was clear or slightly opaque, with no particulate matter. 
Immunized each rabbit with 0.5ml DNA mixture per side 
(TM/Quadriceps), total 2 sides with 1.0ml per animal. Animals were 
shaved prior to immunization, under sedation of lx dose IP (by animal 
weight) of Ketamine-Xylazine (80mg/ml - 4mg/ml). DNA injection used 
needle. Following the DNA injection, electroporation was administrated 
using a 6-needle circular array with 1cm diameter, 1cm needle length. 
Electroporation pulses were given at 20V/mm, 50ms pulse length, 1 
pulse/s. 

Immunization 3-4: Protein Immunization 

Protein doses were 50ug protein per animal. The initial protein was 
diluted to 0.100 mg/ml in citrate buffer. Stored at -80°C until use. 
Thawed at room temperature; material was clear with no particulate 
matter. Added equal volume of MF59C adjuvant to thawed protein and 
mixed well by inverting the tube. Immunized each rabbit with 0.5ml 
adjuvanted protein per side, IM/Glut for a total of 1ml per animal. Used 
material within 1 hour of the addition of adjuvant. Needles were used for 
injections. 
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Group 


Preparation 


9 


Immunization 1-3: Subtype B plasmid DNA in Saline 

The immunogen was provided at 1 .Omg/ml total DNA in sterile 0.9% 
saline. Stored at -80°C until use. Thawed DNA at room temperature; the 
material was clear or slightly opaque, with no particulate matter. 
Immunized each rabbit with 0.5ml DNA mixture per side 
(IM/Quadriceps), total 2 sides with 1.0ml per animal. Animals were 
shaved prior to immunization, under sedation of lx dose IP (by animal 
weight) of Ketamine-Xylazine (80mg/ml - 4mg/ml). DNA injection used 
needle. Following the DNA injection, electroporation was administrated 
using a 6-needle circular array with 1cm diameter, 1cm needle length. 
Electroporation pulses were given at 20V/mm, 50ms pulse length, 1 
pulse/s. 

Immunization 3-4: Protein Immunization 

Protein doses were 50ug each protein per animal, total lOOug. The initial 
protein was diluted to 0.200 mg/ml in citrate buffer. Stored at -80°C until 
use. Thawed at room temperature; material was clear with no particulate 
matter. Added equal volume of MF59C adjuvant to thawed protein and 
mixed well by inverting the tube. Immunized each rabbit with 0.5ml 
adjuvanted protein per side, DVI/Glut for a total of 1ml per animal. Used 
material within 1 hour of the addition of adjuvant. Needles were used for 
injections. 


11 


Immunization 1-3: Subtype B plasmid DNA in Saline 

The immunogen was provided at 1. Omg/ml total DNA in sterile 0.9% 
saline. Stored at -80°C until use. Thawed DNA at room temperature; the 
material was clear or slightly opaque, with no particulate matter. 
Immunized each rabbit with 0.5ml DNA mixture per side 
(IM/Quadriceps), total 2 sides with 1.0ml per animal. Animals were 
shaved prior to immunization, under sedation of lx dose IP (by animal 
weight) of Ketamine-Xylazine (80mg/ml - 4mg/ml). DNA injection used 
needle. Following the DNA injection, electroporation was administrated 
using a 6-needle circular array with 1cm diameter, 1cm needle length. 
Electroporation pulses were given at 20V/mm, 50ms pulse length, 1 
pulse/s. 

Immunization 3-4: Protein Immunization 

Protein doses were 50ug protein per animal. The initial protein was 
diluted to 0.100 mg/ml in citrate buffer. Stored at -80°C until use. 
Thawed at room temperature; material was clear with no particulate 
matter. Added equal volume of MF59C adjuvant to thawed protein and 
mixed well by inverting the tube. Immunized each rabbit with 0.5ml 
adjuvanted protein per side, IM/Glut for a total of 1ml per animal. Used 
material within 1 hour of the addition of adjuvant. Needles were used for 
injections. 
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Group 


Preparation 







The immunization (Table 7) schedules were as follows: 
Table 7 



Imm'n: 
Weeks: 
Group 


1 

0 


2 
4 


3 

12 


4 

24 


1 


Gpl40dV2 
SF162 + MF59C 


Gpl40dV2SF162 
+ MF59C 


Gpl40dV2 SF162 
+ MF59C 


Gpl40 dV2 
SF162 + MF59C 


2 
3 


Gpl40 dV2 
SF162 + 
Iscomatrix 
pCMV 140 dV2 
SF162 DNA 


Gpl40 dV2 SF162 
+ Iscomatrix 

pCMV 140 dV2 
SF162 DNA 


Gpl40 dV2 SF162 
+ Iscomatrix 

pCMV 140 dV2 
SF162 DNA 
Gpl40dV2 SF162 
+ MF59C 


Gpl40dV2 
SF162 + 
Iscomatrix 
Gpl40 dV2 
SF162 + MF59C 


4 


pCMV 140 dV2 
SF162 DNA 


pCMV 140 dV2 
SF 162 DNA 


pCMV 140 dV2 
SF162 DNA 
Gpl40 dV2 SF162 
+ Iscomatrix 


Gpl40dV2 
SF162 + 
Iscomatrix 


5 


Gpl40 dV2 TV1 
+ MF59C 


Gpl40 dV2 TV1 + 
MF59C 


Gpl40dV2 TV1 + 
MF59C 


Gpl40dV2 TV1 
+ MF59C 


6 


PCMV 140 dV2 
TV1 DNA 


pCMVHO dV2 
TV1 DNA 


P CMV140dV2 
TV1 DNA 
Gpl40 dV2 TV1 + 
MF59C 


Gpl40 dV2 TV1 
+ MF59C 


7 


pCMV 140 dV2 
SF162 DNA + 
PCMV 140 dV2 
TV1DNA 


pCMV 140 dV2 
SF162 DNA + 
PCMV 140 dV2 
TV1 DNA 


pCMV 140 dV2 
SF162DNA + 
PCMV 140 dV2 
TV1 DNA 
Gpl40dV2 SF162 
+ MF59C 


Gpl40 dV2 
SF162 + MF59C 


8 


pCMV 140 dV2 
SF162 DNA + 
PCMV 140 dV2 
TV1 DNA 


pCMV 140 dV2 
SF162 DNA + 
PCMV 140 dV2 
TV1 DNA 


pCMV 140 dV2 
SF162DNA + 
PCMV 140 dV2 
TV1 DNA 
Gpl40dV2TVl + 
MF59C 


Gpl40 dV2 TV1 
+ MF59C 
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Imm n: 


1 


2 


3 


4 


Weeks: 


0 


4 


12 


24 


Group 










9 


pCMV 140 dV2 


pCMV 140 dV2 


pCMV 140 dV2 


Gpl40dV2 \ 




SF162 DNA + 


SF162DNA + 


SF162 DNA + 


SF162 + MF59C 




PCMV 140 dV2 


PCMV 140 dV2 


PCMV 140 dV2 


upi4U avz ivi 




TV1 DNA 


TV1 DNA 


TV1 DNA 


+ MF59C 








Gpl40dV2 SF162 


(lOOugProt.) 








+ MF59C 










Gpl40dV2 TV1 + 










MF59C(100ug 










Prot.) 




10 


pCMV 140 dV2 


pCMV 140 dV2 


pCMV 140 dV2 


Gpl40 dV2 




SF162 DNA + 


SF162DNA + 


SF162 DNA + 


SF162+MF59C 




PCMV 140 dV2 


PCMV 140 dV2 


PCMV 140 dV2 


Gp 140 d V2 TV 1 




TV1 DNA 


TV1 DNA 


TV1 DNA 


+ MF59C 








Gpl40dV2 SF162 


(50ug Prot.) 








+ MF59C 










Gpl40 dV2TVl + 










MF59C 










(50ug Prot.) 




11 


pCMV 140 dV2 


pCMV 140 dV2 


pCMV 140 dV2 


Gpl40 dV2 




SF162 DNA + 


SF162 DNA + 


SF162 DNA + 


SF162 + MF59C 




PCMV 140 dV2 


PCMV 140 dV2 


PCMV 140 dV2 






TV1 DNA 


TV1 DNA 


TV1 DNA (l.Omg) 










Gpl40 dV2 SF162 










+ MF59C 






Note: all DNA 


Note: all proteins 








was l.Omg each 


were 50ug each 








except group 11 


except group 10 








used 0.5mg DNA 


used 25ug ea. 








each. 









Table 7 (cont.) 



Imm'n: 


5 


6 


Weeks: 


41 


56 


Group 






1 


Gpl40 dV2 SF162 + 


Gpl40dV2 SF162 + 




MF59C 


MF59C 


2 


Gpl40dV2 SF162 + 


Gpl40dV2 SF162 + 




Iscomatrix 


Iscomatrix 


3 


Gpl40dV2 SF162 + 


Gpl40 dV2 SF162 + 




MF59C 


MF59C 


4 


Gpl40dV2 SF162 + 


Gpl40 dV2 SF162 + 




Iscomatrix 


Iscomatrix 


5 


Gpl40 dV2 TV1 + MF59C 


Gpl40 dV2 TV1 + MF59C 


6 


Gpl40dV2TVl +MF59C 


Gpl40 dV2 TV1 + MF59C 


7 


Gpl40dV2 SF162 + 


Gpl40dV2SF162 + 




MF59C 


MF59C 
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ImnVn- 


5 


6 


ee . 


41 


56 


Group 






8 


Gpl40 dV2 TV1 +MF59C 


Gpl40 dV2 TV1 +MF59C 


9 


Gpl40 dV2 SF162 + 


Gpl40dV2SF162 + 




MF59C 


MF59C 




Gpl40dV2TVl +MF59C 


Gpl40 dV2 TV1 +MF59C 




(lOOugProt.) 


(lOOugProt.) 


10 


Gpl40dV2 SF162 + 


Gpl40dV2SF162 + 




MF59C 


MF59C 




Gpl40 dV2 TV1 + MF59C 


Gpl40 dV2 TV1 + MF59C 




(50ug Prot.) 


(50ug Prot.) 


11 


Gpl40dV2 SF162 + 


Gpl40dV2 SF162 + 




MF59C 


MF59C 




Note: all DNA was l.Omg 


Note: all proteins were 




each except group 11 


50ug each except group 




used 0.5mg DNA each. 


10 used 25ug each. 



The bleeding (Table 8) schedules for all groups (A-F) were as follows: 
Table 8 



Bleed: 


0 


1 


2 


3 


4 


5 


6 


7 


Week: 


0 


2 


6 


8 


12 


14 


16 


24 


Sample: 


Clotted 


Clotted 


Clotted 


Clotted 


Clotted 


Clotted 


Clotted 


Clotted 


Bid. 


Bid. 


Bid. 


Bid. 


Bid. 


Bid. 


Bid. 


Bid. 




for Serum 


for Serum 


for Serum 


for Serum 


for Serum 


for Serum 


for Serum 


for Serum 


Bleed: 


8 


9 


10 


11 


12 


13 


14 


75 


Week: 


26 


28 


41 


43 


45 


56 


58 


60 


Sample: 


Clotted 


Clotted 


Clotted 


Clotted 


Clotted 


Clotted 


Clotted 


Clotted 


Bid. 


Bid. 


Bid. 


Bid. 


Bid. 


Bid. 


Bid. 


Bid. 




for Serum 


for Serum 


for Serum 


for Serum 


for Serum 


for Serum 


for Serum 


for Serum 



To evaluate the combination effects of subtype C (TV1) and subtype B 
(SF162) gpl40dV2 DNAs and proteins for DNA prime/boost on the generation of 
neutralizing antibody activity against HIV strain SF162 (type B) the following 

1 0 comparisons were carried out. 

Neutralizing antibody responses against PBMC-grown SF162 and TV1 HIV-1 
strains were monitored in the sera collected from the immunized rabbits using the 
following assay conducted essentially as follows. Virus neutralization was measured 
in 5.25.EGFP.Luc.M7 (M7-luc) cells obtained from Dr. Nathaniel Landau (Salk 

1 5 Institute, San Diego, CA). The format of this assay was essentially the same as the 
MT-2 assay that has been described elsewhere (Montefiori, et al., J. Clin Microbiol. 
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26:231-235, 1988) except that virus infection was quantified by luciferase reporter 
gene expression using a commercial luciferase kit (Promega). All serum samples were 
heat-inactivated for 1 hour at 56°C prior to assay. The virus stocks of the HIV-1 
islolates were generated in PBMC. Neutralizing antibody titers are reported as 
5 reciprocal serum dilution at which 50% luciferase activity was measured in test wells 
as compared to virus control wells. Values shown in Figures 4 and 5 are the geometric 
mean titers plus standard errors of the neutralization titers for each group of animals. 

The results of the assays for the presence of neutralizing antibodies are 
presented in Figure 4 and Figure 5. In the figures, the following Immunization Groups 
10 correspond to the Groups in Table 4: B DNA + B prot; C DNA + B prot (Group 6); 
B+C DNA + B prot (Group 7); B+C DNA + C prot (Group 8); B+C DNA & prot 
(Group 9); B+C DNA & prot (1/2) (Group 10); and, B+C DNA (1/2) + C prot (Group 
11). 

In Figure 4, the first vertical bar of each group of three bars is neutralizing 

15 activity against HrV-1 SF-162 in prebleed rabbit serum (Figure 4, Prebleed), the 
second vertical bar is serum from a bleed two weeks after the third immunization 
(Figure 4, 2 wk post 3 rd ), and the third vertical bar is serum from a bleed two weeks 
after the fourth immunization (Figure 4, 2 wk post 4 th ). 

Figure 4 summarizes data showing the neutralization titers against HIV-1 

20 SF162 between the 7 groups described above. These results demonstrated that all 

groups showed strong neutralizing activity against the HIV-1 SF162 isolate. Further, 
neutralizing activity significantly increased at post 4 th immunization compared to post 
3 rd immunizations. Priming and boosting with B gene and B protein (B DNA + B 
prot) showed a high titer, as did the C gene and B protein (C DNA + B prot). For the 

25 mixed (B+C) DNA prime and single protein boost, B protein gave a high boost to the 
mixed gene prime (B+C DNA + B prot) and a boost to the C protein (B+C DNA + C 
prot). For the mixed DNA prime and protein boost, half dose (50ug) of protein (B+C 
DNA & prot (1/2)) induced high neutralizing activity as did the full dose of lOOug 
protein (B+C DNA & prot). The mixed DNA prime and single protein boost with 

30 subtype C protein, the half- dose (lmg) DNA (B+C DNA + C prot) also gave 

neutralizing activity, as did the full-dose of 2mg DNA (B+C DNA (1/2) + C prot). 
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In Figure 5, the prebleed values for neutralizing activity against HIV-1 TV1 in 
prebleed rabbit serum were less than one log for each group of bars (Figure 5, 
Prebleed), the grey vertical bars for each group are serum from bleeds two weeks after 
the fourth immunization (Figure 5, 2 wk post 4 th ). 
5 Figure 5 summarizes data showing the neutralization titers against HIV-1 TV1 

(South African subtype C) between the 7 groups described above. These results 
demonstrated that all groups showed neutralizing activity against HF/l subtype C 
TV1 isolate (as expected, because no subtype C DNA or protein was used, the B DNA 
+ B protein showed the lowest neutralizing activity). For the mismatched a single 

10 DNA prime and a single protein boost (C DNA + B prot), priming with C gene and 
boosting with B protein showed a high titer, as did the B gene and B protein (B DNA 
+ B prot). For the mixed (B+C) DNA prime and single protein boost, use of either B 
(B+C DNA + B prot) and C (B+C DNA + C prot) proteins had a similar boosting 
effect. For the mixed DNA prime and protein boost, full dose of lOOug protein (B+C 

1 5 DNA & prot) induced high neutralizing activity, as did the half dose of 50ug protein 
(B+C DNA & prot (1/2)). The half-dose (lmg) DNA (B+C DNA (1/2) + C prot) also 
gave neutralizing activity, as did the full-dose of 2mg DNA (B+C DNA + C prot). 

Comparison of the data presented in Figure 4 and Figure 5 supported the 
combination methods of the present invention for generating an immune response in a 

20 subject. Such a comparison showed that the combination of DNA derived from 

different subtypes primed broad responses to multiple strains from different subtypes. 
This may indicate the targeting common conserved epitopes. Further, use of a single 
subtype protein was sufficient to boost broad neutralizing responses when immunity 
was primed with multiple strains from different subtypes of DNA. The DNA priming 

25 maintained the native envelope structure. This can induce T cell responses in addition 
to the B cell response. Finally, these results demonstrated that use of lower doses of 
proteins mixture can also provide strong immune responses. 

These studies demonstrated the usefulness of the compositions and methods of 
the invention to generate immune responses, in particular to generate broad and potent 

30 neutralizing activity against diverse HIV strains. 
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Example 5 

Immunogenicity Study of El - E3 Deleted, Replication Defective Ad-HPV 
Recombinant Versus E3 Deleted, Replication Competent Ad-HIV Recombinant 
The following experiments were carried out in chimpanzees. Chimpanzees 
5 with minimal Ad5- and Ad7-cross-reactive antibodies were selected for this 

experiment. Ad5 and Ad7 microtiter neutralization assays were performed essentially 
as previously described in Buge, et al., J. Virol. 71:8531-8541 (1997) and Lubeck, et 
al., Nature Med. 3:651-8 (1997). Chimpanzees were immunized according to the 
schedule in Table 9. Each group comprised 2 or 3 animals. 
10 Table 9 



Group 


Number 

of 
Chimps 


Week 0 (IN) 


Week 12 (IN) 


Week 36 
(IM) 


1 


3 


delAd5-E3-HIVgpl60 
10 7 pfu (replication 
competent) 


Ad7delE3- 

HlVgpl60 

10 7 pfu (replication 

competent) 


SF162 

o-gpl40V2 in 
MF59 


2 


2 


Ad5delE3-HIVgpl60 
10 8 pfu (replication 
competent) 


Ad7delE3- 

HIVgpl60 

10 8 pfu (replication 

competent) 


SF162 

o-gpl40V2 in 
MF59 


3 


2 


Ad5delEl/E3-HIVgpl60 
10 8 pfu (replication 
defective) 


Ad7delEl/E3- 

HIVgpl60 

10 8 pfu (replication 

defective) 


SF162 

o-gpl40V2 in 
MF59 


4 


3 


Ad5delEl/E3-HIVgpl60 
10 9 pfu (replication 
defective) 


Ad7delEl/E3- 

HIVgpl60 

10 9 pfu (replication 

defective) 


SF162 

o-gpl40V2 in 
MF59 



(IN = intranasal; DVI = intramuscular) 



The delAd5-E3, Ad7delE3, Ad5delEl/E3, and Ad7delEl/E3 vectors have been 
previously described (Nan X., et al., Development of an Ad7 cosmid system and 
generation of an Ad7deltaEldeltaE3HIV(MN) env/rev recombinant virus. Gene Ther. 
15 Feb; 10(4):326-36 (2003)). 

The Adeno-virus vectors (Ad recombinants) contained inserts derived from 
the HIV-1 subtype B prototype strain MN wherein the inserts encoded the gpl60 
envelope protein (see, e.g., GenBank Accession Ml 7449; Gurgo,C, et al., "Envelope 
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sequences of two new United States HIV-1 isolates," Virology 164(2); 531-6 (1988); 
Lori, F., et al., "Effect of reciprocal complementation of two defective human 
immunodeficiency virus type 1 (HIV-1) molecular clones on HIV-1 cell tropism and 
virulence," J. Virol. 66(9); 5553-60 (1992); Lukashov, V.V., et al., "Increasing 
5 genotypic and phenotypic selection from the original genomic RNA populations of 
HIV-1 strains LAI and MN (NM) by peripheral blood mononuclear cell culture, B- 
cell-line propagation and T-cell-line adaptation," AIDS 9(12); 1307-1 1 (1995). HTV-1 
MN is from one of the earliest available HIV-1 isolates, and is a commonly used 
reference and vaccine strain. 

10 The MN isolate was taken from a six year old male pediatric AIDS patient 

from the area of Newark, New Jersey, USA in 1984. His mother was an IV. drug user 
who died of pneumonia in 1982. His father was also HIV sero-positive. Other 
sequences from this patient from the 1984 blood sample and from a 1987 sample 
taken shortly before death (GenBank Accession U72495) are available. See also 

15 GenBank Accession L48364-L48379. The MN sequence was cloned from the isolate 
in lambda phage. The coding sequences for pol, nef and vpu are prematurely 
truncated; pol shows an in-frame stop codon at 3783, nef and vpu are prematurely 
truncated at position 9357 and position 6142 respectively. Another complete genome 
of the MN isolate is available with GenBank Accession number AF075719 and it too 

20 has defective genes; although not pol nor vpu. A set of V3 sequences from this 
isolate are available (GenBank Accession Accession numbers L48364-L48379; 
Lukashov, V. et al., AIDS 9:1307-131 1 (1995)). The isolate MN is available from the 
NIH AIDS Reagent program, and is X4. 

Ad-recombinant vectors (see Table 9) comprising HIV-1 MN gpl60 protein 

25 coding sequences were diluted in PBS and administered drop-wise into the nostrils, 1 
ml total volume, 500 ul per nostril. Antibiotics are administered for a total of 1 1 days, 
beginning 3 days prior to inoculation. 

The polypeptide component used for a protein boost comprised SF162 o- 
gpl40V2 protein. This protein is from the same HTV-1 subtype as the gpl60 coding 

30 sequences used in the polynucleotide component, which were derived from HIV-1 
MN. The SF162 o-gpl40V2 protein was prepared using CMV3 vector comprising the 
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gpl40.mut7.mod.SF162.delV2 sequence expressed in CHO cells followed by oligo- 
protein isolation essentially as previously described, for example, in PCT International 
Publication No. WO/00/39302. 

The protein boost was typically 100 ug of SF162 o-gpl40V2 per chimpanzee. 
5 The SF1 62 o-gp 1 40V2 protein was provided at 0.200 mg/ml in citrate buffer, stored at 
-80°C until use, and thawed at room temperature. The material as clear with no 
particulate 

matter. Equal volume of MF59C adjuvant was added. The mixture was stored at 4°C 
and mixed well by inverting the tube several times before use. 
10 Each animal was immunized with a total volume of 1 ml per animal (using 1 or 

2 IM sites per animal). Material was used within 1 hour of the addition of adjuvant. 

Blood, secretory samples, and stool specimens were collected. Typically for 
blood samples, a 10 ml bleed was obtained for serum and a 30 ml bleed for 
heparinized blood. 

15 The following assays were carried out on the collected samples. 

A. Binding assays for HIV envelope antibodies by ELISA on immunized 
chimpanzee serum. 

Standard HIV Env ELISA methods were employed in binding assays to detect 
HIV envelope antibodies in sera from chimpanzees immunized as just described. The 
20 methods were essentially as described by Buge, et al., J. Virol. 71:8531-8541 (1997) 
and Lubeck, et al, Nature Med. 3:651-8 (1997). Table 10 presents data for binding 
antibody titers to HIVIIIB gp 1 60. 



Table 10 



Replication 
competent Ad 


Dose 


Chimp 


At time of 
gpl40 
boost 


2 weeks 
later 


4 weeks 
later 




10 y 


4x0363 
(SW) 


275 


42,000 


10,400 






4x0271 
(SW) 


7,500 


21,800 


10,700 






A163 (NI) 


pending 




















10" 


4x0386 


100 


11,000 


11,700 
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(SW) 












182D (NI) 


100 


800 


410 














Replication 
incompetent Ad 


Dose 


Chimp 


At time of 
gpl40 
boost 


2 weeks 
later 


4 weeks 
later 




10 s 


4x0376 
(SW) 


10 


100 


100 






4x0360 
(SW) 


pending 




















10 y 


4x0373 
(SW) 


75 


33,000 


28,500 






87A003 (NI) 


10 


75 


10 






A136(NI) 


10 


590 


440 



Data for binding antibody titers to SF162 envelope protein was also evaluated 
and is shown in Figure 20. 



5 B. Neutralizing antibody assays against TCLA and primary HIV isolates. 

Virus neutralization against TCLA strains was measured in the MT-2 assay 
(Montefiori, et al., J. Clin Microbiol 26:231-235 (1988)). Virus neutralization against 
primary HIV-1 strains was measured in M7-luc cells obtained from Dr. Nathaniel 
Landau (Salk Institute, San Diego, CA). The format of this assay was essentially the 

10 same as the MT-2 assay as described elsewhere (Montefiori, et al. J. Clin Microbiol. 
26:231-235 (1988)) except that virus infection was quantified by luciferase reporter 
gene expression using a commercial luciferase kit (Promega). All serum samples were 
heat-inactivated for 1 hour at 56°C prior to assay. The virus stocks of the HIV-l 
isolates were generated in PBMC. 

15 Table 1 1 presents the neutralizing antibody data from these studies in 

chimpanzees. 



Table 11 



Group/Animal 


Vector/dose 


Bleed 
day 


HIV-1 MN 1 


HIV-1 
SF162 2 


1-1 

4x0271 (SW) 


delE3, 10 7 


0 


<20 


<20 


1-1 




105 


<20 


<20 
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Group/Animal 


Vector/dose 


Bleed 
day 


HIV-1 MN 1 


HIV-1 
SF162 


1-1 




273 


48 


40 


1-2 

4x0363 (SW) 


delE3,10 7 


0 


<20 

: 


<20 

— - 


1-2 






105 


25 


92 


1-2 




273 


1,296 


5,877 


2-1 

4xU3»0 (SW) 


delE3, 10 8 


0 


<20 


<20 


2-1 




105 


<20 


20 


2-1 




273 


228 


133 


2-2 
1 82D (NI) 


delE3, 10 8 


0 


<20 


<20 




2-2 




105 


47 


97 


2-2 




273 


5,801 


3,437 


3-1 

4xUJ /o (o W) 


delEl,E3, 
10 


0 


<20 


<20 


3-1 




— 105 — 


<20 


<20 


3-1 




273 


<20 


<20 


4-1 

4x03 / 3 (aw) 


delEl,E3, 

ID 


0 


<20 


<20 




4-1 




105 


34 


<20 


4-1 




273 


72 


119 


4-2 

87A003 (NI) 


delEl,E3, 
10 


0 


<20 


<20 


4-2 




— 105 — 


<20 




4-2 




273 


<20 


<20 


4-3 
A136(NI) 


delEl,E3, 
10 9 


0 


<20 


<20 


4-3 




105 


<20 


<20 


4-3 




273 


<20 


21 



1 - determined by MT-2 assay described above. Neutralizing antibody titers 
are reported as reciprocal serum dilution at which 50% cell killing was measured in 
test wells as compared to virus control wells. 
5 2 - determined by M71uc assay described above. Neutralizing antibody titers 

are reported as reciprocal serum dilution at which 50% luciferase activity was 
measured in test wells as compared to virus control wells. 

The results in Table 1 1 support the use of the combination approaches 
described herein to induce potent and broad HIV-neutralization activity. For example, 
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on bleed day 273 sera obtain from all animals in Groups 1-3 comprised neutralizing 
antibodies against both the subtype B strain from which envelope protein coding 
sequences were derived (HIV-1 MN) for polynucleotide immunization and the 
subtype B strain from which envelope protein coding sequences were derived (HIV-1 
5 SF162) for polypeptide immunization. One chimp in group 4 showed neutralizing 
antibodies, which correlated with better Ab response for that animal as seen in Table 
10. 

Overall, the replication competent recombinant Adeno vectors (delE3) 
provided a stronger priming of B cell responses than did the replication incompetent 

10 Adeno constructs (del El, E3) with higher Env-specific binding antibody titers as 

measured by ELISA and higher serum neutralizing antibody responses against the MN 
and SF162 virus strains. These data demonstrate that a subject can be immunized with 
an envelope protein from a first HIV strain of a given subtype, be boosted with an 
envelope protein from a second HIV strain of the same subtype and generate 

15 neutralizing antibodies against both HIV strains. The data presented in Example 4 in 
combination with the data presented in Example 5 together demonstrate that such HIV 
strains may be within subtype, or from different subtypes. 

Further assays may be used to evaluate the immune responses of the 
immunized chimpanzees, including, but not limited to, the following: 

20 A. ELISPOT for Ad5 fiber, Ad7 fiber, HIV env, and HlVrev overlapping 
peptides. 

Assay methods are essentially as described in Zhao, et al., J.Virol. 77:8354- 
8365 (2003). Peptides for use in this assay are derived from HTV-1MN Env and Rev 
or Ad fiber protein. 

25 

B. CTL assays by CR-release. 

This standard CTL assay was carried out essentially as described by Lubeck, et 
al., Nature Med. 3:651-8 (1997), and Buge, et al., J. Virol. 71:8531-8541 (1997). 

30 C. Proliferative T-cell responses against HIV gp 1 20; aldrithiol-2 inactivated 
gpl20; Ad proteins. 
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These assays were carried out essentially as described in Buge, et al., J. Virol. 
71:8531-8541 (1997). The data is shown in Figure 21. 



D. Ad5 and Ad7 microtiter neutralization assays. 

5 These assays are carried out essentially as described in Buge, et al., J. Virol. 

71:8531-8541 (1997). 

E. Ad shedding in nasal and stool samples by PCR. 

These assays are carried out essentially as described in Buge, et al., J. Virol. 
10 71:8531-8541 (1997). 



The data in this example support that the combination methods of the present 
invention can be used to broadly raise neutralizing antibodies against multiple viral 
strains of the same subtype. 

15 

Although preferred embodiments of the subject invention have been described 
in some detail, it is understood that obvious variations can be made without departing 
from the spirit and the scope of the invention. The following embodiments are offered 
for illustrative purposes only, and are not intended to limit the scope of the present 
20 invention in any way. 
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Exemplary Embodiments of the Present Invention: 

1 . A composition for generating an immune response in a mammal, said 
composition comprising, 

5 a polynucleotide component consisting essentially of one polynucleotide 

encoding an HIV immunogenic polypeptide derived from a first HIV strain of a first 
subtype, and 

a polypeptide component comprising one or more HIV immunogenic 
polypeptides analogous to the polypeptide encoded by said polynucleotide component, 
10 with the proviso that at least one HIV immunogenic polypeptide of the polypeptide 
component is derived from a second HIV strain of the first subtype, wherein said first 
HIV strain and said second HIV strain are different. 

2. A composition for generating an immune response in a mammal, said 
15 composition comprising, 

a polynucleotide component comprising two or more polynucleotide sequences 
comprising coding sequences for two or more analogous HIV immunogenic 
polypeptides derived from a first HIV subtype, wherein the coding sequences for at 
least two of the HIV immunogenic polypeptides are derived from different HIV strains 

20 of the first subtype, and 

a polypeptide component comprising one or more HIV immunogenic 
polypeptides analogous to the polypeptide encoded by said polynucleotide component, 
with the proviso that, if the polypeptide component comprises the same number or 
greater than the number of analogous HIV immunogenic polypeptides encoded by the 

25 polynucleotide component, then at least one of the HIV immunogenic polypeptides of 
the polypeptide composition is derived from a different HIV strain of the first subtype 
than the HIV immunogenic polypeptides provided by the polynucleotide component. 

3. A composition for generating an immune response in a mammal, said 
30 composition comprising, 
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a polynucleotide component consisting essentially of one polynucleotide 
encoding an HIV immunogenic polypeptide derived from a first HIV strain of a first 
subtype, and 

a polypeptide component comprising one or more HIV immunogenic 
polypeptides analogous to the polypeptide encoded by said polynucleotide component, 
with the proviso that at least one HIV immunogenic polypeptide of the polypeptide 
component is derived from a second HIV strain of the first subtype, wherein said first 
HIV strain and said second HIV strain are different; 

with the provisos that (i) the polynucleotide component does not encode an 
analogous HIV immunogenic polypeptide derived from any subtype other than the 
first subtype, and (ii) the polypeptide component does not comprise an analogous HIV 
immunogenic polypeptide derived from any subtype other than the first subtype. 

4. A composition for generating an immune response in a mammal, said 
composition comprising, 

a polynucleotide component comprising two or more polynucleotide sequences 
comprising coding sequences for two or more analogous HIV immunogenic 
polypeptides derived from a first HIV subtype, wherein the coding sequences for at 
least two of the HIV immunogenic polypeptides are derived from different HIV strains 
of the first subtype, and 

a polypeptide component comprising one or more HIV immunogenic 
polypeptides analogous to the polypeptide encoded by said polynucleotide component, 
with the proviso that, if the polypeptide component comprises the same number or 
greater than the number of analogous HIV immunogenic polypeptides encoded by the 
polynucleotide component, then at least one of the HIV immunogenic polypeptides of 
the polypeptide composition is derived from a different HIV strain of the first subtype 
than the HIV immunogenic polypeptides provided by the polynucleotide component; 

with the further provisos that (i) the polynucleotide component does not 
encode an analogous HIV immunogenic polypeptide derived from any subtype other 
than the first subtype, and (ii) the polypeptide component does not comprise an 
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analogous HIV immunogenic polypeptide derived from any subtype other than the 
first subtype. 

5. The composition of any of embodiments 1 to 4, wherein said 

5 polynucleotide component comprises at least one polynucleotide that is a native 
polynucleotide. 

6. The composition of any of embodiments 1 to 4, wherein said 
polynucleotide component comprises at least one polynucleotide that is a synthetic 

10 polynucleotide. 

7. The composition of embodiment 6, wherein said synthetic polynucleotide 
comprises codons optimized for expression in mammalian cells. 

1 5 8. The composition of embodiment 7, wherein said synthetic polynucleotide 

comprises codons optimized for expression in human cells. 

9. The composition of any of embodiments 1 to 4, wherein the HIV 
immunogenic polypeptides are HIV envelope polypeptides. 

20 

10. The composition of embodiment 9, wherein at least one of said HIV 
polypeptides comprises one or more mutations. 

11. The composition of embodiment 10, wherein at least one of said envelope 
25 polypeptides comprises a mutation in the cleavage site or a mutation in the 

glycosylation site. 

12. The composition of embodiment 10, wherein at least one of said envelope 
polypeptides comprises a deletion or modification of the VI region. 

30 
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13. The composition of embodiment 10, wherein at least one of said envelope 
polypeptides comprises a deletion or modification of the V2 region. 

14. The composition of embodiment 10, wherein at least one of said envelope 
5 polypeptides comprises a deletion or modification of the V3 region. 

15. The composition of embodiment 10, wherein at least one of said envelope 
polypeptides comprises a deletion or modification of regions selected from the group 
consisting of the VI region, the V2 region, the V3 region, and combinations thereof. 

10 

16. The composition of embodiment 10, wherein at least one of said envelope 
polypeptides comprises envelope polypeptide modified to expose an envelope binding 
region that binds to a CCR5 chemokine co-receptor. 

15 17. The composition of any of embodiments 1 to 4, wherein at least one 

polynucleotide encoding an HIV immunogenic polypeptide encodes an immunogenic 
HIV polypeptide selected from the group consisting of: Gag, Env, Pol, Prot, Int, RT, 
vif, vpr, vpu, tat, rev, and nef. 

20 18. The composition of any of embodiments 1 to 4, wherein the first HIV 

subtype is selected from the group consisting of: subtype A, subtype B, subtype C, 
subtype D, subtype E, subtype F, subtype G, subtype H, subtype I, subtype J, subtype 
K, subtype N and subtype O. 

25 19. The composition of any of embodiments 1 to 4, wherein at least one of 

said immunogenic HIV polypeptides comprises one or more mutations. 

20. The composition of any of embodiments 1 to 4, wherein said 
polynucleotide component further comprises a sequence encoding an additional 
30 antigenic polypeptide, with the proviso that the additional antigenic polypeptide is not 
an immunogenic polypeptide derived from an HIV-1 strain. 
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21. The composition of embodiment 20, wherein said polypeptide component 
further comprises a polypeptide having an additional antigenic peptide, with the 
proviso that the additional antigenic polypeptide is not an immunogenic polypeptide 
5 derived from an HIV-1 strain. 



22. The composition of any of embodiments 1 to 4, wherein said 
polynucleotide component further comprises sequences encoding one or more control 
elements compatible with expression in a selected host cell, wherein said control 

10 elements are operable linked to polynucleotides encoding HIV immunogenic 
polypeptides. 

23. The composition of embodiment 22, wherein said control elements are 
selected from the group consisting of a transcription promoter, a transcription 

15 enhancer element, a transcription termination signal, polyadenylation sequences, 

sequences for optimization of initiation of translation, an internal ribosome entry site, 
and translation termination sequences. 



24. The composition of embodiment 23, wherein said transcription promoter 
20 is selected from the group consisting of CMV, CMV+intron A, SV40, RSV, HIV-Ltr, 
MMLV-ltr, and metallothionein. 



25. A method of generating an immune response in a subject, comprising, 
providing the composition for generating an immune response in a mammal of 
25 embodiments 1,2 3, or 4; 

administering one or more gene delivery vectors comprising the 
polynucleotides of said polynucleotide component of the composition into said subject 
under conditions that are compatible with expression of said polynucleotides in said 
subject for the production of encoded HIV immunogenic polypeptides; and 
30 administering the polypeptide component to said subject. 
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26. The method of embodiment 25, wherein said one or more gene delivery 
vectors and said polypeptide component are administered concurrently. 

27. The method of embodiment 26, wherein said one or more gene delivery 
5 vectors and said polypeptide component are administered sequentially. 

28. The method of embodiment 25, wherein said polypeptide component 
further comprises an adjuvant. 

10 29. The method of embodiment 25, wherein said polynucleotide component 

further comprises a carrier. 

30. The method of embodiment 25, wherein said one or more gene delivery 
vectors are nonviral vectors. 

15 

3 1 . The method of embodiment 25, wherein said one or more gene delivery 
vectors are delivered using a particulate carrier. 

32. The method of embodiment 3 1 , wherein said one or more gene 
20 delivery vectors are coated on a gold or tungsten particle and said 

coated particle is delivered to said subject using a gene gun. 

33. The method of embodiment 3 1 , wherein said one or more gene delivery 
vectors are delivered using a PLG particle. 

25 

34. The method of embodiment 25, wherein said one or more gene delivery 
vectors are encapsulated in a liposome preparation. 

35. The method of embodiment 25, wherein said one or more gene delivery 
30 vectors are viral vectors. 
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36. The method of embodiment 35, wherein said viral vectors are retroviral 

vectors. 

37. The method of embodiment 35, wherein said viral vector are lentiviral 

5 vectors. 

38. The method of embodiment 35, wherein said viral vectors are alphaviral 

vectors. 

10 39. The method of embodiment 25, wherein said subject is a mammal. 

40. The method of embodiment 39, wherein said mammal is a human. 

41. The method of embodiment 25, wherein said immune response is a 
15 humoral immune response. 

42. The method of embodiment 25, wherein said immune response is a 
cellular immune response. 

20 43. The method of embodiment 25, wherein said one or more gene delivery 

vectors are administered intramuscularly, intramucosally, intranasally, 
subcutaneously, intradermal ly, transdermal ly, intravaginally, intrarectally, orally or 
intravenously. 

25 44. The method of embodiment 25, wherein said immune response results in 

generating neutralizing antibodies in the subject against multiple strains derived from 
the first HIV subtype. 

30 
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COMBINATION APPROACHES FOR GENERATING IMMUNE 
RESPONSES AGAINST MULTIPLE VIRAL STRAINS SELECTED FROM A 
GIVEN SUBTYPE OR SEROTYPE 

5 Abstract of the Disclosure 

The present invention relates to methods, polynucleotides, and polypeptides 
encoding immunogenic HIV polypeptides derived from different strains within an 
HIV subtype. Uses of the polynucleotides and polypeptides in combination 
10 approaches for generating immune responses are described. The combination 

approaches described herein have been shown to induce broad and potent neutralizing 
activity against diverse HIV strains from multiple strains within a given subtype. 
Formulations of compositions for generating immune responses and methods of use 
for such compositions are also disclosed. 
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1 TGGAAGGGTT AATTTACTCC AAGAAAAGGC AAGAAATCCT TGATTTGTGG GTCTATCACA 
.61 CACAAGGCTT CTTCCCTGAT TGGCAAAACT ACACACCGGG GCCAGGGGTC AGATATCCAC 
121 TGACCTTTGG ATGGTGCTAC AAGCTAGTGC CAGTTGACCC AGGGGAGGTG GAAGAGGCCA 
181 ACGGAGGAGA AGACAACTGT TTGCTACACC CTATGAGCCA ACATGGAGCA GAGGATGAAG 
241 ATAGAGAAGT ATTAAAGTGG AAGTTTGACA GCCTCCTAGC ACGCAGACAC ATGGCCCGCG 
301 AGCTACATCC GGAGTATTAC AAAGACTGCT GACACAGAAG GGACTTTCCG CCTGGGACTT 
361 TCCACTGGGG CGTTCCGGGA GGTGTGGTCT GGGCGGGACT TGGGAGTGGT CAACCCTCAG 
421 ATGCTGCATA TAAGCAGCTG CTTTTCGCCT GTACTGGGTC TCTCTCGGTA GACCAGATCT 
481 GAGCCTGGGA GCCCTCTGGC TATCTAGGGA ACCCACTGCT TAAGCCTCAA TAAAGCTTGC 
541 CTTGAGTGCT TTAAGTAGTG TGTGCCCATC TGTTGTGTGA CTCTGGTAAC TAGAGATCCC 
601 TCAGACCCTT TGTGGTAGTG TGGAAAATCT CTAGCAGTGG CGCCCGAACA GGGACCAGAA 
661 AGTGAAAGTG AGACCAGAGG AGATCTCTCG ACGCAGGACT CGGCTTGCTG AAGTGCACAC 
721 GGCAAGAGGC GAGAGGGGCG GCTGGTGAGT ACGCCAATTT TACTTGACTA GCGGAGGCTA 
781 GAAGGAGAGA GATGGGTGCG AGAGCGTCAA TATTAAGCGG CGGAAAATTA GATAAATGGG 
841 AAAGAATTAG GTTAAGGCCA GGGGGAAAGA AACATTATAT GTTAAAACAT CTAGTATGGG 
901 CAAGCAGGGA GCTGGAAAGA TTTGCACTTA ACCCTGGCCT GTTAGAAACA TCAGAAGGCT 
961 GTAAACAAAT AATAAAACAG CTACAACCAG CTCTTCAGAC AGGAACAGAG GAACTTAGAT 
1021 CATTATTCAA CACAGTAGCA ACTCTCTATT GTGTACATAA AGGGATAGAG GTACGAGACA 
1081 CCAAGGAAGC CTTAGACAAG ATAGAGGAAG AACAAAACAA ATGTCAGCAA AAAGCACAAC 
1141 AGGCAAAAGC AGCTGACGAA AAGGTCAGTC AAAATTATCC TATAGTACAG AATGCCCAAG 
1201 GGCAAATGGT ACACCAAGCT ATATCACCTA GAACATTGAA TGCATGGATA AAAGTAATAG 
12 61 AGGAAAAGGC TTTCAATCCA GAGGAAATAC CCATGTTTAC AGCATTATCA GAAGGAGCCA 
1321 CCCCACAAGA TTTAAACACA ATGTTAAATA CAGTGGGGGG ACATCAAGCA GCCATGCAAA 
1381 TGTTAAAAGA TACCATCAAT GAGGAGGCTG CAGAATGGGA TAGGACACAT CCAGTACATG 
1441 CAGGGCCTGT TGCACCAGGC CAGATGAGAG AACCAAGGGG AAGTGACATA GCAGGAACTA 
1501 CTAGTACCCT TCAGGAACAA ATAGCATGGA TGACAAGTAA TCCACCTATT CCAGTAGAAG 
1561 ACATCTATAA AAGATGGATA ATTCTGGGGT TAAATAAAAT AGTAAGAATG TATAGCCCTG 
1621 TTAGCATTTT GGACATAAAA CAAGGGCCAA AAGAACCCTT TAGAGACTAT GTAGACCGGT 
1681 TCTTTAAAAC CTTAAGAGCT GAACAAGCTA CACAAGATGT AAAGAATTGG ATGACAGACA 
1741 CCTTGTTGGT CCAAAATGCG AACCCAGATT GTAAGACCAT TTTAAGAGCA TTAGGACCAG 
1801 GGGCCTCATT AGAAGAAATG ATGACAGCAT GTCAGGGAGT GGGAGGACCT AGCCATAAAG 
1861 CAAGAGTGTT GGCTGAGGCA ATGAGCCAAG CAAACAGTAA CATACTAGTG CAGAGAAGCA 
1921 ATTTTAAAGG CTCTAACAGA AT TAT T AAAT GTTTCAACTG TGGCAAAGTA GGGCACATAG 
1981 CCAGAAATTG CAGGGCCCCT AGGAAAAAGG GCTGTTGGAA ATGTGGACAG GAAGGACACC 
2041 AAATGAAAGA CTGTACTGAG AGGCAGGCTA ATTTTTTAGG GAAAATTTGG CCTTCCCACA 
2101 AGGGGAGGCC AGGGAATTTC CTCCAGAACA GACCAGAGCC AACAGCCCCA CCAGCAGAAC 
2161 CAACAGCCCC ACCAGCAGAG AGCTTCAGGT TCGAGGAGAC AACCCCCGTG CCGAGGAAGG 
2221 AGAAAGAGAG GGAACCTTTA ACTTCCCTCA AATCACTCTT TGGCAGCGAC CCCTTGTCTC 
2281 AATAAAAGTA GAGGGCCAGA TAAAGGAGGC TCTCTTAGAC ACAGGAGCAG ATGATACAGT 
2341 ATTAGAAGAA ATAGATTTGC CAGGGAAATG GAAACCAAAA ATGATAGGGG GAATTGGAGG 
2401 TTTTATCAAA GTAAGACAGT ATGATCAAAT ACTTATAGAA ATTTGTGGAA AAAAGGCTAT 
24 61 AGGTACAGTA TTAGTAGGGC CTACACCAGT CAACATAATT GGAAGAAATC TGTTAACTCA 
2521 GCTTGGATGC ACACTAAATT TTCCAATTAG TCCTATTGAA ACTGTACCAG TAAAATTAAA 
2581 ACCAGGAATG GATGGCCCAA AGGTCAAACA ATGGCCATTG ACAGAAGAAA AAATAAAAGC 
2641 ATTAACAGCA ATTTGTGAGG AAATGGAGAA GGAAGGAAAA ATTACAAAAA TTGGGCCTGA 
27 01 TAATCCATAT AACACTCCAG TATTTGCCAT AAAAAAGAAG GACAGTACTA AGTGGAGAAA 
27 61 ATTAGTAGAT TTCAGGGAAC TCAATAAAAG AACTCAAGAC TTTTGGGAAG TTCAATTAGG 
2821 AATACCACAC CCAGCAGGAT TAAAAAAGAA AAAATCAGTG ACAGTGCTAG ATGTGGGGGA 
2881 TGCATATTTT TCAGTTCCTT TAGATGAAAG CTTCAGGAAA TATACTGCAT TCACCATACC 
2941 TAGTATAAAC AATGAAACAC CAGGGATTAG ATATCAATAT AATGTGCTGC CACAGGGATG 
3001 GAAAGGATCA CCAGCAATAT TCCAGAGTAG CATGACAAAA ATCTTAGAGC CCTTCAGAGC 
3061 AAAAAATCCA GACATAGTTA TCTATCAATA TATGGATGAC TTGTATGTAG GATCTGACTT 
3121 AGAAATAGGG CAACATAGAG CAAAAATAGA AGAGTTAAGG GAACATTTAT TGAAATGGGG 



FIGURE IB 

3181 ATTTACAACA CCAGACAAGA AACATCAAAA AGAACCCCCA TTTCTTTGGA TGGGGTATGA 
3241 ACTCCATCCT GACAAATGGA CAGTACAACC TATACTGCTG CCAGAAAAGG ATAGTTGGAC 
3301 TGTCAATGAT ATACAGAAGT TAGTGGGAAA ATTAAACTGG GCAAGTCAGA TTTACCCAGG 
3361 GATTAAAGTA AGGCAACTCT GTAAACTCCT CAGGGGGGCC AAAGCACTAA CAGACATAGT 
34 21 ACCACTAACT GAAGAAGCAG AATTAGAATT GGCAGAGAAC AGGGAAATTT TAAGAGAACC 
34 81 AGTACATGGA GTATATTATG ATCCATCAAA AGACTTGATA GCTGAAATAC AGAAACAGGG 
3541 GCATGAACAA TGGACATATC AAATTTATCA AGAACCATTT AAAAATCTGA AAACAGGGAA 
3601 GTATGCAAAA ATGAGGACTA CCCACACTAA TGATGTAAAA CAGTTAACAG AGGCAGTGCA 
3661 AAAAATAGCC ATGGAAAGCA TAGTAATATG GGGAAAGACT CCTAAATTTA GACTACCCAT 
3721 CCAAAAAGAA ACATGGGAGA CATGGTGGAC AGACTATTGG CAAGCCACCT GGATCCCTGA 
3781 GTGGGAGTTT GTTAATACCC CTCCCCTAGT AAAATTATGG TACCAACTAG AAAAAGATCC 
3841 CATAGCAGGA GTAGAAACTT TCTATGTAGA TGGAGCAACT AATAGGGAAG CTAAAATAGG 
3901 AAAAGCAGGG TATGTTACTG ACAGAGGAAG GCAGAAAATT GTTACTCTAA CTAACACAAC 
3961 AAATCAGAAG ACTGAGTTAC AAGCAATTCA GCTAGCTCTG CAGGATTCAG GATCAGAAGT 
4 021 AAACATAGTA ACAGACTCAC AGTATGCATT AGGAATCATT CAAGCACAAC CAGATAAGAG 
4081 TGACTCAGAG ATATTTAACC AAATAATAGA ACAGTTAATA AACAAGGAAA GAATCTACCT 
4141 GTCATGGGTA CCAGCACATA AAGGAATTGG GGGAAATGAA CAAGTAGATA AATTAGTAAG 
4201 TAAGGGAATT AGGAAAGTGT TGTTTCTAGA TGGAATAGAT AAAGCTCAAG AAGAGCATGA 
4261 AAGGTACCAC AGCAATTGGA GAGCAATGGC TAATGAGTTT AATCTGCCAC CCATAGTAGC 
4321 AAAAGAAATA GTAGCTAGCT GTGATAAATG TCAGCTAAAA GGGGAAGCCA TACATGGACA 
4 381 AGTCGACTGT AGTCCAGGGA TATGGCAATT AGATTGTACC CATTTAGAGG GAAAAATCAT 
4 441 CCTGGTAGCA GTCCATGTAG CTAGTGGCTA CATGGAAGCA GAGGTTATCC CAGCAGAAAC 
4 501 AGGACAAGAA ACAGCATATT TTATATTAAA ATTAGCAGGA AGATGGCCAG TCAAAGTAAT 
4561 AC AT AC AG AC AATGGCAGTA ATTTTACCAG TACTGCAGTT AAGGCAGCCT GTTGGTGGGC 
4 621 AGGTATCCAA CAGGAATTTG GAATTCCCTA CAATCCCCAA AGTCAGGGAG TGGTAGAATC 
4 681 CATGAATAAA GAATTAAAGA AAATAATAGG ACAAGTAAGA GATCAAGCTG AGCACCTTAA 
4741 GACAGCAGTA CAAATGGCAG TATTCATTCA CAATTTTAAA AGAAAAGGGG GAATTGGGGG 
4801 GTACAGTGCA GGGGAAAGAA TAATAGACAT AATAGCAACA GACATACAAA CTAAAGAATT 
4861 ACAAAAACAA ATTATAAGAA TTCAAAATTT TCGGGTTTAT TACAGAGACA GCAGAGACCC 

4 921 TATTTGGAAA GGACCAGCCG AACTACTCTG GAAAGGTGAA GGGGTAGTAG TAATAGAAGA 
4981 TAAAGGTGAC ATAAAGGTAG TACCAAGGAG GAAAGCAAAA ATCATTAGAG ATTATGGAAA 
5041 ACAGATGGCA GGTGCTGATT GTGTGGCAGG TGGACAGGAT GAAGATTAGA GCATGGAATA 
5101 GTTTAGTAAA GCACCATATG TATATATCAA GGAGAGCTAG TGGATGGGTC TACAGACATC 
5161 ATTTTGAAAG CAGACATCCA AAAGTAAGTT CAGAAGTACA TATCCCATTA GGGGATGCTA 
5221 GATTAGTAAT AAAAACATAT TGGGGTTTGC AGACAGGAGA AAGAGATTGG CATTTGGGTC 
5281 ATGGAGTCTC CATAGAATGG AGACTGAGAG AATACAGCAC ACAAGTAGAC CCTGACCTGG 
5341 CAGACCAGCT AATTCACATG CATTATTTTG ATTGTTTTAC AGAATCTGCC ATAAGACAAG 
5401 CCATATTAGG ACACATAGTT TTTCCTAGGT GTGACTATCA AGCAGGACAT AAGAAGGTAG 
54 61 GATCTCTGCA ATACTTGGCA CTGACAGCAT TGATAAAACC AAAAAAGAGA AAGCCACCTC 
5521 TGCCTAGTGT TAGAAAATTA GTAGAGGATA GATGGAACGA CCCCCAGAAG ACCAGGGGCC 
5581 GCAGAGGGAA CCATACAATG AATGGACACT AGAGATTCTA GAAGAACTCA AGCAGGAAGC 

5 641 TGTCAGACAC TTTCCTAGAC CATGGCTCCA TAGCTTAGGA CAATATATCT ATGAAACCTA 
5701 TGGGGATACT TGGACGGGAG TTGAAGCTAT AATAAGAGTA CTGCAACAAC TACTGTTCAT 
57 61 TCATTTCAGA ATTGGATGCC AACATAGCAG AATAGGCATC TTGCGACAGA GAAGAGCAAG 
5821 AAATGGAGCC AGTAGATCCT AAACTAAAGC CCTGGAACCA TCCAGGAAGC CAACCTAAAA 
5881 CAGCTTGTAA TAATTGCTTT TGCAAACACT GTAGCTATCA TTGTCTAGTT TGCTTTCAGA 
5941 CAAAAGGTTT AGGCATTTCC TATGGCAGGA AGAAGCGGAG ACAGCGACGA AGCGCTCCTC 
6001 CAAGTGGTGA AGATCATCAA AATCCTCTAT CAAAGCAGTA AGTACACATA GTAGATGTAA 
6061 TGGTAAGTTT AAGTTTATTT AAAGGAGTAG ATTATAGATT AGGAGTAGGA GCATTGATAG 
6121 TAGCACTAAT CATAGCAATA ATAGTGTGGA CCATAGCATA TATAGAATAT AGGAAATTGG 
6181 TAAGACAAAA GAAAATAGAC TGGTTAATTA AAAGAATTAG GGAAAGAGCA GAAGACAGTG 
6241 GCAATGAGAG TGATGGGGAC ACAGAAGAAT TGTCAACAAT GGTGGATATG GGGCATCTTA 
6301 GGCTTCTGGA TGCTAATGAT TTGTAACACG GAGGACTTGT GGGTCACAGT CTACTATGGG 



6361 GTACCTGTGT GGAGAGAAGC 

6421 GAGACAGAAG TGCATAATGT 

6481 CAAGAAATAG TTTTGGGAAA 

6541 GATCAGATGC ATGAGGATAT 

6601 TTGACCCCAC TCTGTGTCAC 

6661 GTTACAGGTA ATACAAATGA 

6721 AAAAATTGCT CTTTCAATGC 

6781 CTCTTTTATA AACTTGATAT 

6841 TTAATAAATT GCAATACCTC 

6901 ATTCCTATAC ATTACTGTGC 

6961 TTCAATGGGA CAGGACCATG 

7021 CCAGTGGTAT CAACTCAACT 

7081 AGATCTGAAA ATTTGACAGA 

7141 GAGATTAATT GTACAAGGCC 

7201 CAAGCATTCT ATGCAACAAA 

7261 AGTACAGATA GATGGAATAA 

7321 CCTAATAAAA CAATAAAATT 

7381 AGCTTTAATT GTAGAGGAGA 

7441 TACTACCCTA AGAATGGTAC 

7501 CAATGCAAAA TAAAACAAAT 

7561 CCTCCCATTG CAGGAAACAT 

7621 GATGGGGGAT TTAACAACAC 

7681 GATATGAGGG ATAACTGGAG 

77 41 TTGGGAATAG CACCCACTAA 

7801 GGAATAGGAG CTGTGTTCCT 

7861 TCAATAACGC TGACGGTACA 

7921 AATTTGCTGA AGGCTATAGA 

7 981 AAGCAGCTCC AGGCGAGAGT 

8041 GGGATTTGGG GCTGCTCTGG 

8101 TGGAGTAATA AATCTGAAGC 

8161 GAAATTAATA ATTACACAGA 

8221 GAAAAGAATG AAAAAGATTT 

8281 GACATATCAA ACTGGCTGTG 

8341 GGTTTAAGAA TAATTTTTGC 

84 01 CCTTTGTCAT TTCAGACCCT 

84 61 GAAGAAGAAG GTGGAGAGCA 

8521 TCGCTTGCCT GGGACGATCT 

8581 TTCATATTAA TTGCAGTGAG 

8641 CAGAGGGGGT GGGAGATCCT 

8701 CTAAAAAAGA GTGCTATTAG 

87 61 GATAGGATTA TAGAATTGGT 

8821 ATAAGACAGG GCTTTGAAGC 

8881 CATAGTTGGA TGGCCTGCAG 

8941 AGTAGGAGCA GCGTCTCAAG 

9001 TGCTACTAAT GAAGCTTGTG 

9061 TCCAGTCAGA CCTCAGGTAC 

9121 CTTCTTTTTA AAAGAAAAGG 

9181 AATCCTTGAT TTGTGGGTCT 

9241 ATCGGGGCCA GGGGTCCGAT 

9301 TGACCCAAGG GAGGTGAAAG 

9361 GAGCCAACAT GGAGCAGAGG 

94 21 TCTAGCACAC AGACACATGG 



FIGURE 1C 

AAAAACTACT CTATTCTGTG 
CTGGGCTACA CATGCTTGTG 
TGTAACAGAA AATTTTAATA 
AATCAGTTTA TGGGATCAAA 
TTTAAACTGT ACAGATACAA 
TACCAATATT GCAAATGCTA 
AACCACAGAA TTAAGAGATA 
AGTACCACTT AATGAAAATA 
AACCATAACA CAAGCCTGTC 
TCCAGCTGAT TATGCGATTC 
TTATAATGTC AGCACAGTAC 
ACTGTTAAAT GGTAGTCTAG 
GAATACCAAA ACAATAATAG 
CAACAATAAT ACAAGGAAAA 
TGACGTAATA GGAAACATAA 
AACTTTACAA CAGGTAATGA 
TGAACCAGAT GCAGGAGGGG 
ATTTTTCTAT TGCAATACAT 
ATACAAATAC AATGGTAATT 
TGTACGCATG TGGCAAGGGG 
AACATGTAGA TCAAACATCA 
AAACAACGAC ACAGAGGAGA 
AAGTGAATTA TATAAATATA 
GGCAAAAAGA AGAGTGGTGC 
TGGGTTCTTG GGAGCAGCAG 
GGCCAGACAA CTGTTGTCTG 
GGCGCAACAG CATATGTTGC 
CCTGGCTATA GAAAGATACC 
AAGACTCATC TGCACCACTG 
AGATATTTGG GATAACATGA 
AACAATATTC AGGTTGCTTG 
ATTAGAATTG GACAAGTGGA 
GTATATAAAA ATATTCATAA 
TGTGCTCTCT ATAGTGAATA 
TACCCCAAGC CCGAGGGGAC 
AGACAGAGAC AGATCCATAC 
GCGGAGCCTG TGCCTCTTCA 
GGCAGTGGAA CTTCTGGGAC 
TAAGTATCTG GGAAGTCTTG 
TCCGCTTGAT ACCATAGCAA 
ACAAAGAATT TGTAGAGCTA 
AGCTTTGCTA TAAAATGGGA 
TAAGAGAAAG AATGAGAAGA 
ACTTAGATAG ACATGGGGCA 
CCTGGCTGCA AGCACAAGAG 
CTTTAAGACC AATGACTTAT 
GGGGACTGGA AGGGTTAATT 
ATAACACACA AGGCTTCTTC 
TCCCACTGAC CTTTGGATGG 
AGGCCAATGA AGGAGAAGAC 
ATGAAGATAG AGAAGTATTA 
CCCGCGAGCT ACATCCGGAG 



CATCAGATGC TAAAGCATAT 
TACCCACAGA CCCCAACCCA 
TGTGGAAAAA TAACATGGCA 
GCCTAAAGCC ATGTGTAAAG 
ATGTTACAGG TAATAGAACT 
CATATAAGTA TGAAGAAATG 
AGAAACATAA AGAGTATGCA 
GTAACAACTT TACATATAGA 
CAAAGGTCTC TTTTGACCCG 
TAAAGTGTAA TAATAAGACA 
AATGTACACA TGGAATTAAG 
CAGAAGAAGG GATAATAATT 
TACATCTTAA TGAATCTGTA 
GTGTAAGGAT AGGACCAGGA 
GACAAGCACA TTGTAACATT 
AAAAATTAGG AGAGCATTTC 
ATCTAGAAAT TACAATGCAT 
CAAACCTGTT TAATAGTACA 
CAAGCTTACC CATCACACTC 
TAGGACAAGC AATGTATGCC 
CAGGAATACT ATTGACACGT 
CATTCAGACC TGGAGGAGGA 
AAGTGGTAGA AATTAAGCCA 
AGAGAAAAAA AAGAGCAGTG 
GAAGCACTAT GGGCGCAGCG 
GTATAGTGCA ACAGCAAAGC 
AACTCACAGT CTGGGGCATT 
TAAAGGATCA ACAGCTCCTA 
CTGTGCCTTG GAACTCCAGT 
CTTGGATGCA GTGGGATAGA 
AAGACTCGCA AAACCAGCAG 
ATAATCTGTG GAATTGGTTT 
TGATAGTAGG AGGCTTGATA 
GAGTTAGGCA GGGATACTCA 
TCGACAGGCT CGGAGGAATC 
GATTGGTGAG CGGATTCTTG 
GCTACCACCG CTTGAGAGAC 
ACAGCAGTCT CAGGGGACTA 
TGCAGTATTG GGGTCTAGAG 
TAGCAGTAGC TGAAGGAACA 
TCCTCAACAT ACCTAGGAGA 
GGCAAGTGGT CAAAACGCAG 
ACTGAGCCAG CAGCAGAGGG 
CTTACAAGCA GCAACACACC 
GAGGACGGAG ATGTAGGCTT 
AAGAGTGCAG TAGATCTCAG 
TACTCTAGGA AAAGGCAAGA 
CCTGATTGGC AAAACTACAC 
TGCTTCAAGC TAGTACCAGT 
AACTGTTTGC TACACCCTAT 
AAGTGGAAGT TTGACAGCCT 
TATTACAAAG ACTGCTGACA 



FIGURE ID 

9481 CAGAAGGGAC TTTCCGCCTG GGACTTTCCA CTGGGGCGTT CCGGGAGGTG TGGTCTGGGC 

9541 GGGACTTGGG AGTGGTCACC CTCAGATGCT GCATATAAGC AGCTGCTTTT CGCTTGTACT 

9601 GGGTCTCTCT CGGTAGACCA GATCTGAGCC TGGGAGCTCT CTGGCTATCT AGGGAACCCA 

9661 CTGCTTAGGC CTCAATAAAG CTTGCCTTGA GTGCTCTAAG TAGTGTGTGC CCATCTGTTG 

9721 TGTGACTCTG GTAACTAGAG ATCCCTCAGA CCCTTTGTGG TAGTGTGGAA AATCTCTAGC 

9781 A 



Figure 2A 

I : indicates the regions for p-sheet and V1/V2 loop deletions 



*: is the N-linked glycosylation sites for subtype C TV1 and TV2. Possible 
mutation (N-> Q) or deletions can be performed. 
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( 1 ) MDAMKRGLCCVLLLCGAVFVSP-SAVEKLWVTVY YGVPVWKEATTT 

( 1 ) MRVMGTQKNCQQWW I WG I LGFWMLMI - CNTEDLWVT V Y YG V PVWRDAKTT 

( 1 ) MRVMGTQKNCQQWW I WG I LGFWMLMI -CNTEDLWVTVYYGVPVWREAKTT 

(1) MRARGILKNYRHWWIWGILGFWMLMM-CNVKGLWVTVYYGVPVGREAKTT 

( 1 ) MRVKGIPRNWQQWWIWGSLGFWIIC--SVMGNLWVTVYYGVPVWREAKTT 

( 1 ) MRVRGTLRNYQQWWIWGVLGFWMLMICNGGGNLWVTV Y YGVPVWKEAKTT 

(1) MRVMGIQRNCQHLLTWGIMILGTIIFCSAVENLWVTVYYGVPVWRDADTT 

( 1 ) MRVREIERNYLCLWRWGIMLLGMLMTYSVAEKKWVTVYYL-VPVWKEA-TT 

( 1 ) MDAMKRGLCCVLLLCGAVFVSP-SASHNLWVTVYYGVPVWRDADTT 

(1) MRV G RN Q WWIWGILGFWMLM S E LWVTVYYGVPVWREAKTT 
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100 



(4 6) LFCASDAKAYDTEVHI V J n F i N f IWKNNMV 

(50) LFCAS DAKAYETEVH.NVWATHACVPTDPNPQE I VLGNVTEN FNMWKM DMA 

(50) LFCASDAKAYETEVHMVWATHACVPTDPNPQEIVLGNVTENFNMKKNNMA 

(50) LFCASDAKAYEKEVHHVWATHACVPTDPNPQEVILGNVTENFNKWKKDMV 
(49) LFCASDArCAYEAEVHNVWATHACVPTDPNPQEIELKMVTENFWMWENDMV 

(51) LLCASDAKAYEREVHMVWATHACVPTDPNPQEIVLGMVTENFNMKKNDMV 
(51) LFCASDAKAYETEKHNVWATHACVPTDPNPQEIHLDNVTEKFNMWKMNMV 
(51) LFCASDAKSYKTEVHMIWATHACVPTDPNPREIELENVTENFHMWKHNMV 
(46) LFCA S DftKAHETEVHNVW ATHACV PT DP'M PQH) I H LENV TEN FNMWKNNMV 
(51) LFCASDAKAYETEVHNVWATHACVPTDPNPQEIVL NVTENFNMWKNNMV 

P2/V1V2/P3 

101 | * *150 

(96) EQMHEDII.3LWDQSLKPCVKLTPLCyTLHCTNLKNATN TKSS 

( 100 ) DQMHEDVI 3LWDQ3LKPCVKLTPLCVTLNCTDTNVTGNRTVTGNSTNNTN 
(100) DQMHFhT ■ ' [ , ' y ■ v. i"T TPLCVT LNCTDTNVTGNRTVTGNTNDTNI 

(100) DQMQEDI I SLWDQSLKPCVKLTPLCVTLNCTNATVNYN NTS— 

(99) DQMHEDIISLWDQSLKPCVKLTPLCVTLNCKNVTSKDI NI 

(101) DQMHEDVI SLWDQSLKE'CVKLTPLCVTLECRNVSRNVS SY 

(101) EQMHTDI I SLWDQSLKPCVKLTPLC'yTLHCTNVTSVNT 

(101) EQMHEDIISLWDQSLKPCVKLTPLCVTLNCTDARRNET RNNIT 

(96) EQMQEDVISLWDQSLKPCVKLTPLCVTLNCTNAKLTNV NNITSVS 

(101) DQMHEDI I SLWDQSLKPCVKLTPLCVTLNCTN 
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151 * * 200 

(138) NWKEMDRGEIKNCSFKVTTSJRNKMQKEYAH' YKLDVVPIUN DNT 

(150) GTGIYNIEEMKNCSFNATTELRDKKHKEYALFYRLDIVPLN ENSDNF 

(150) ANATYKYEEMKNCSFNATTELRDKKHKEYALFYKLDIVPLN ENSNNF 

(141) KDMKNCSFYVTTELRDKKKKENALFYRLDIVPLNNR-KNGNIN 

(139) TSNAEMKAEMKMCSFNITTELRDKKKQEYALFYKLDIVPLTNDNASENAS 
(141) NTYNGSVEEIKNCSFNATPEVRDRKQRMYALFYGLDIVPLNKKNSSENSS 

(139) -TG— DREGLKNCSFNMTTELRDKRQKVYSLFYRLDIVPINE NQGS 

(144 ) GMENNDQIEMKNCSFNITTKLIDKKKQVHALFYRLDVVQIDNDTSNSNYS 

(141) NTIGNITDEVRNCSFNMTTELRDKKQKVHALFYKLDIVPIED NKTSS 

(151) T EEMKNCSFNITTELRDKK KEYALFYKLDIVPLN N N S 



Figure 2B 
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(183) SYKLIHCNTSVITQACPKVSFEPI PIHYCAPAGFAILKCNDKKFHGSGPC 

(197) TYRLINCNTSTITQACPKVSFDPIPIHYCAPAGYAILKCNNKTFMGTGPC 

(197) TYRLINCNTSTITQACPKVSFDPI PIHYCAPADYAI LKCNNKTFNGTGPC 

(183) NYRLI WCNTSAITQACPKVSFDPIFIHYCAPAGYAPLKCNNKKFBGIGPC 

(189) EYRLINCDTSTITQSCPKVTFDPIPIHYCAPAGYVILKCNNKTFHGTGPC 

(191) EYRLIMCNTSAITQACPKVTFDPIPIHYCAPA.GYAILKCNNKTFMGTGPC 

(182) EYRLINCNTSAITQACPKVSFEPIPIHYCTPAGFAILKCKDEGFHGTGLC 

(194) NYRLI NCNTSAITQACPKVTFEPI PIHYCAPAGFAI LKCRDKKFHGTGPC 

(188) EYRLI NCNTSVIKQACPKISFDPI PI HYCTPAGYAILKCNDKNFHGTGPC 

(201) YRLINCNTS ITQACPKVSFDPI PIHYCAPAGYAILKCNNK FNGTGPC 
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251 * * 300 

(233) TNVSTVQCTHGIRPVVSTQLLLHGSLAEEGWIRSENFTDNAKTIiyQLK 

(247) YNySTVQCTHGIKPVVSTQLLLHGSLAEEGIIIRSENLTEHTKTIIVHLN 

(247) YNVSTVQCTHGIKPWSTQLLLHGS LAEEGI I IRSENLTENTKTI I VHLN 

(233) DNVSTVQCTHGIKPWSTgLLLNGSLAEEEIIIRSEKLTNNVKTIIVHLN 

(239) NMVSTVQCTHGIKPVVSTQLLLNGSLAEKEI I IRSKNITDNVKT I IVHLN 

(241) NHVSTVQCTHGIKPyVSTQLLLNGSLAEGEIIIRSEMLTNNVKTIIVHLN 

(232) KWV3TVQCTHGIKPVVSTQLLLNGSLAEKKITIR3EKITNHAKIIIVQLV 

(244) KNVST'v'OCTHGIRPyySTQLLLWGSLAEEEI 1 1 RSEHLTNNAKTLIVQLN 

(238) KMVSSVQCTHGIKPyVSTQLLLNGSLAEEEII IRSEHLTNNAKTIIVHLN 

(251) NVSTVQCTHGIKPWSTQLLLNGSLAEEEIIIRSENLTNN KTIIVHLN 
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301 * * 350 

(283) ESVEINCTRPN-NNTRKSITIGPGPAFYATGDIIGDIRQAHCNISGEKWN 

(2 97) ESVEINCTRPN-NNTkKSVRIGPGQAFYATNDVIGNIRQAHCNISTDRWM 

(2 97) ESVE INCTRPN-NNTRKSVRIGPGQAFYATNDVIGHIRQAHCNISTDRWN 

(283) ESIEIKCTRPG-NNTRKSVRIGPGQAFYATGDIIGDIRQAHCNISKNEWN 

(289) ESVEIEC'TRPG-WNTRRSVRIGPGQAFYATGDIIGDIRAAHCNISESKWN 

(2 91) QSVEIVCTRPN-NNTRKSIRIGPGQTFYATGDIIGDIFiQAHCNISRDKWN 

(282) QPVTI KCIRPN-NNTRKSIRIGPGQAFYATGDIIGDIRQAHCNVTRSRWN 

(294) ESVEINCTRPYYNQIRQRTSIGQGQALYTTR-VTGDIRKAYCNISKAGWN 

(288) KSVEINCTP-PS-NNTRTSITIGPGQVFYRTGDI I GDIF.KAYCEINGTKwH 

(301) ESVEINCTRPN NNTRKSIRIGPGQAFYATGDIIGDIRQAHCNIS KWN 
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(332) NTLKQIVTKLQAQFGNKT-IVFKQSSGGDPEIVMHSFNCGGEFFYCNSTQ 

(34 6) KTLQQVMKKLGEHFPNKT-IQFKPHAGGDLEITMHSFNCRGEFFYCNTSN 

(34 6) KTLQQVMKKLGEHFPNKT-IKFEPHAGGDLEITMHSFNCRGEFFYCNTSN 

(332) TTLQRVSQKLQELFPNSTGIKFAPHSGGDLEITTHSFMCGGEFFYCNTTD 

(338) KILYRVSEKLKEHFPNKT-IQFDQPIGGDLEITTHSFNCRGEFFYCNTSK 

(340) ETLQRVGKKLAEHFHNKT-IKFASSSGGDLEITTHSFKCRGEFFYCNTSG 

(331) KTLQEVAEKLRTYFGNKT-IIFANSSGGDLEITTHSFNCGGEFFYCNTSG 

(343) KTLQQVAKKLGDLFNiaTT-IIFKPSSGGDPEITTHSFMCGGEFFYCNTSK 

(337) EVLTQVTEKLKEHFNNKT-IIFQPPSGGDLEITMHHFNCRGEFFYCNTTR 

(351) KTLQQV KL EHF NKT I F P SGGDLEITTHSFNCRGEFFYCNTS 



Figure 2C 
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401 I 020/021 ^50 

(381) LFNSTWNN TIGPNN — TNGTITLPCRIKQIINRWQEVGKAMYAPP 

(395) LFNSTYHS NNGTYKYNGNSSSPITLQCKIKQIVRMWQGVGQATYAPP 

(395) LFMSTYYP KNGTYKYNGNSSLPITLQCKIKQIVRMWQGVGQAMYAPP 

(382) LFNSTYSNGTCTNGTCMSN-- NTERITLQCRIKQIINMWQEVGRAMYAPP 

(387) LFNGTYNS TGDTSN STITLSCRIKQI INMWQGVGRAMYASP 

(389) LFNGTYMPTYMPNGTESNS NSTITI PCRIKQI INMWQEVGRAMYAPP 

(380) LFWSTWYVNSTWNDTDSTQ-ESNDTITLPCRIKQIINMWQRAGQAMYAPP 
(392) LFMSAWND-STWNIGNNNTGSDNETI IIPCRI K.QI INMWQGVGKAMYAP? 

(386) LFNHTCIE NGTMGGC NGTI I LPCK J K Q 1 1 NMWQGAGQAM YAPP 

(401) LFNSTY NGT N N TITLPCRIKQIINMWQGVGRAMYAPP 
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451* * * 500 

(424) IRGQIRCSSNITGLLLTRDGGKEISNT TEIFRPGGGDMRDNWRSELY 

(442) IAGNITCR3NITGILLTRDGGFNTTNN— -TETFRPGGGDMRDMWRSELY 

(442) IAGNITCRSNITGILLTRDGGFNNTNNDT-EETFRPGGGDMRDNWP.SELY 

(430) IAGNITCRSHITGLLLTRDGGDNNTET ETFRPGGGDMRDNWRSELY 

(428) IAGNITCK5NITGLLLTRDGGNETS.GI E I FR P AGG DMRDNWR S E L Y 

(436) IAGNITCTSNITGLLLVHDGGIKEWDTENKTEIFRPGGGDMRDNWRSELY 

(429) IPGVl'KCESNITGLLLTRDGGKDNNVN ETFRPGGGDMRDI-J K RS E L Y 

(441) IEGWINCASNITGLLLVRDGGGANDSQ NETFRPQGGDMRDMWRSELY 

(429) ISGRIHCVSNITG1LLTRDGGAINTTN ETFRPGGGNIKDNWRSELY 

(451) IAGNITC SNITGLLLTRDGG NT N ETFRPGGGDMRDNWRSELY 
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501 550 

(471) K Y K WK I E PL G VA P T KAK P. R VVQ RE KRAVT L GAMFL G F LG AA G5 TMG AR S 

(489) KYKVVEIKPLGIAPTKAKRRVVQREKRAVGIGAVFLGFLGAAGSTMGAAS 

(491) KYKVVEIKPLGIAPTKAKRRWQRKKFiAVGI GAVFLGFLGAAGSTMGAAS 

(476) KYKVVEIKPLGVAPTAAKP.RWEREKRAVGI GAVFLGFLGAAGSTMGAAS 

(474) KYKVVEIKPLGLAPTKS.KRRWEREKRAVTFGAMFLGFLGAAGSTMGAAS 
(486) KYKWEIKPLGVAPTAAKRRWEREKRAVGIGAVFLGFLGAAGSTMGAAS 

(475) KY KV VE I E P LGVA PTRAKRRW EREKRA VG I GAVFLGF LGAA GS TMGAT S 
(488) KYKVVKIEPLGiAPTKAKRRWEREKRAIGLGAMFLGFLGAAGSTMGAAS 
(475) KYKWQIEPLGIAPTRAKRRWEREKRAVGIGAMIFGFLGAAGSTMGAAS 
(501) KYKWE I KPLGIAPTKAKRR WE RE KRAVGI GAVFLGFLGAAGSTMGAAS 
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551 



600 



(521) LTLTVQARQLLSGIVQQQNHLLRAIEAQQHLLQLTVWGIKQLQARVLAVE 

(539) IT LTVQRRQ LLSG I VQQQSNLLKAIEAQQ HMLQ LTVWG I KQLQARV LAI E 

(541) ITLTVQARQLLSGIVQQQSNLLKAIEAQQHMLQLTVWGIKQLQARVLATE 

(526) ITLTVQARQLLSGI VQQQSHLLRAIEAQQHMLQLTVWGIKQLQARVLAI'E 

(524) MTLTVQARQLLSGIVQQQSNLLRAIEAQQHMLQLTVWGIKQLQTRVLAVE 
(536) ITLTAQARQLLSGIVQQQSNLLRAI EAQQHLLQLTVWGIKQLQTRVLATE 

(525) ITLTVQAP.QLLSGIVQQQNMLLRAIEAQQHLLKLTVWGIKQLQARVLAVE 
(538) LTLTVQARQLLSGIVQHQKNLLMAIEAQQHLLQLTVWGIKQLQARILAVE 
(525) ITLTVQARQLLSGI VQQQSNLLRA. I EAQQHLLQLTVWGIKQLQARVLAVE 
(551) ITLTVQARQLLSGIVQQQSNLLRAIEAQQHLLQLTVWGIKQLQARVLAVE 



Figure 2D 



B-SF162 
C-TV1.8_2 
C-TV1.8_5 
C-TV2 . 12-5/1 
C-MJ4 
IndiaC-93IN101 
A-Q2317 
D-92UG001 
E-cm235 
Consensus 



601 * 650 

(571) RYLKDQQLLGIWGCSGKLICTTAVPWNASWSNKSLDQIWNNMTWMEWERE 

(589) RYLKDQQLLGIWGCSGRLICTTAVPWHSSWSNKSEKDIWDWMTWMQWDRE 

(591) RYLKDQQLLGIWGCSGRLICTTAVPWKSSWSNKSEADIWDNMTWMQWDRS 

(57 6) RYLQDQQLLGLWGCSGKLICTTNVLWNSSWSNKTQSDIWDNMTWMQWDRE 

(574) FiYLRDQQLLGIWGCSGKLICTTAVPWHSSWSNKSQHDIWDNLTWMQWDRE 
(586) RYLKDQQLLGIWGCSGKLICTTAVPWNSSWSNKT'QSEIWNNMTWMQWDRE 

(575) RYLRDQQLLGIWGCSGKLICTTNVPWHSSWSNKSLDjEIWNHMTWLQWDKE 
(588) RYLQDQQLLGSWGCSGRHICTTTVPWNSSWSNKSIDDIWNNMTWMEWEKE 
(575) RYLKDQKFLGLWGCSGKIICTTAVPWKSTWSNRSYEEIWNNMTWIEWERE 
(601) RYLKDQQLLGIWGCSGKLICTTAVPWNSSWSNKS DIWNNMTWMQWDRE 
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700 



(621) IDNYTNLIYTLIEE3QNQQEKNEQELLELDKWA5LKNWFDISKWLWYIKI 

(639) ISNYTGLIYNLLEDSQNQQEKHEKDLLELDKWNNLVTOWFDISNWPWYIKI 

( 641 ) INNYTETIFRLLEDSQNQQEKNEKDLLELDKWNNLWNWFDISNWLWY I KI 

(626) ISNYTNTIYRLLEDSQSQQERNEKDLLALDRWNNLWNWFSltNWLWYIKI 

(624) ISNYTDTIYRLLEESQNQQERNEKDLLALDSWKTLWSWFDISNWLWYIKI 
(636) VSNYTNIIYSLLEE3QNQQEKNEKDLLALDSWKNLWSWFDI.TNWLWYIKI 

(625) INHYTQLIYRLIEESQNQQEKNEKELLELDKWANLVJSWFDISNWLWYIKI 
(638) IDNYTGVIYRLIEESQTQQEKNEQELLQLDKWASLWNWFSITKWLWYIKI 
(625) ISNYTNQIYEILTESQNQQDRNEKDLLELDKWASLWNWFDITKWLWYIKI 
(651) ISNYTNLIYRLLEESQNQQEKNEKDLLELDKW NLWNWFDISNWLWYIKI 
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(671) M " [V I r-QTRFPAPRGP PEG 

(689) FIMIVGGLIGLRI I FA VLSI VHP. VP.QGY3 PLSFQTLTPSPRGLDP.LGGIE 

(691) FIMIVGGLIGLRI IFAVLS I VNRVRQGYSPLSFQTLTPSPRGLDRLGGIE 

(676) FIMIVGGLIGLRI IFAVLSLVNRVRQGiSPLSLQTLIPNPRGPDRLGGIE 

(674) FIMIVGSLIGLRI IFAVLS I VNRVRQGYSPLSFQTLTPNPRGPDRLEGIE 
(686) FIMIVGGLIGLRI I FA VLSI VNRVRQGYS PL3FQTLTPNPRGPDRLGRIE 

(675) FIIIVGGLIGLRIVFAVLSVINRVRQGYSPLSFQTHTPNPRGLDRPERIE 
(688) FIMIVGGLIGLRIVFTVLSLVNRVRQGYSPLSFQTLFPAPRGPDRPEEIE 
(675) FIMIIGGLIGLRI IFAVLS I VNRVP.QGYSPLSFQTPFHHQREPDRSERIE 
(701) FIMIVGGLIGLRI I FAVLSIVNRVRQGYSPLSFQTLTP PRGPDRLEGIE 
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751 800 

(721) EEGGERDRDRSSPLVHGLLALIWDDLRSLCLFSYHRLRDLILIAARIVEL 

(739) EEGGEQDRDRSIRLVSGFLSLAWDDLPHLCLFSYHRLRDFILIAVRAVEi, 

(741) EEGGEQDRDRSIRLVSGFLSLAWDDLRSLCLFSYHRLRDFILIAVRAVEL 

(726) EEGGEQDSSRSIRLVSGFLTLAWDDLRSLCLFCYHRLRDFILIWRAVEL 

( 724 ) EEGGEQDKDRSIRLVNGSLALAWDDLRSLCLFS YHQLRDFILVVARAVEL 
(736) EEGGEQDKDRSIRLVNGFLALAWDDLRNLCLFSYHRLRDFISVAARWEL 

(725) EEDGEQGRGRSIRLVSGFLALAWDDLRSLCLFSYHRLRDFILIAARTVEL 
(738) EGGGEQGRGRSTRLVNGFSTLIWDDLRNLCLFSYHRLRDLILIATRIVEL 
(725) EGGGEQGRDRSVRLVSGFLALAWDDLRSLCLFSYHRLRDFILIAARTVKL 
(751) EEGGEQDRDRSIRLVSGFLALAWDDLRSLCLFSYHRLRDFILIAAR VEL 



Figure 2E 
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801 850 

(771) LGR RGWEALKYWGNLLQYWIQELKNSAVSLFDAIAIAVAEGTD 

(789) LGHSSLRGLQRGWEILKYLGSLVQYWGLELKKSAISLLDTIAITVAEGTD 

(791) LGHSSLRGLQRGWEILKYLGSLVQYWGLELKKSAISPLDTIAIAVAEGTD 

(776) LGHSSLRGLQRGWGTLKYLGSLVQYWGLELfCKSAINLLDTIAIAVAEGTD 

(774) LGRSSLRGLQRGKEALKYLGSLIQYWGLELKRSTISLLDTVAIAVAEGTD 
(786) LGRSS WEALKYLGSLVQYKGLELKKSAI SLFDSIAIWAEGTD 

(775) LGHSSLKGLRLGWEGIKYLWNLLS YKGRELKI S A I N LV DT I A I A V AGWT D 

(788) LGR RGWEAIKYLWNLLQYW8QE LKTSAI SLFN ATAVAVAEGTD 

(775) LGRSS LKGLRRGWEGLKYLGNLLLYRGQELKI 3AISLLDATAI I VAGWTD 

(801) LGRSSLRGL RGWEALKYLGSLLQYWGLELKKSAISLLDTIAIAVAEGTD 
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E-cm235 
Consensus 



851 879 

(814) RIIEVAQRIGRAFLHIPRRIRQGFERALL 

(839) RI I ELVQRI CRAI LN I PRRI RQGFEAALL 

(841) R 1 1 ELVQRI C RA I LN I PRRI RQG FEAALL 

(826) R I LEFI QNLCRG IRNVPRRIRQGFEAALQ 

(824) RI I ELIQRIWRAICN I PRRI RQGFEAALL 
(829) RI I ELVQGFCRAI RN I PTRI RQGFEAALQ 

(825) R'VI E I AQRI GRAI LH I PVRI RQGLERALL 
(831) RVIEWQRFFRGILNVPTRIRQGLERALL 
(825) RVIEVAQGAWRAILHIFRRIRQGLERTLL 
(851) RIIELVQRI RAILN I PRRI RQGFEAALL 
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gpl40 .modSF162 .delV2 

gaattcgccaccatggatgcaatgaagagagggctctgctgtgtgctgctgctgtgtggagcagtc 
CCcgtttcgcccagcgccgtggagaagctgtgggtgaccgtgtaccacggcgtgcccgtgtggaag 
gaggccaccaccaccctgttccgcgccagcgacgccaaggcctacgacaccgaggtgcacaacgtg 
tgggccacccacgcctgcgtgcccaccgaccccaacccccaggagatcgtgctggagaacgtgacc 
gagaacttcaacatgcggaagaacaacatggtggagcagatgcacgaggacatcatcagcctgtgg 
gaccagagcctgaagccctgcgtgaagctgacccccctgcgcgtgaccccgcactgcaccaacctg 
aagaacgccaccaacaccaagagcagcaactggaaggagacggaccgcggcgagatcaagaaccgc 
agcttcaagatgggcgccggcaagctgaccaactgcaacaccagcgcgaccacccaggcctgcccc 
aaggcgagcctcgagcccacccccatccactactgcgcccccgccggctccgccatcccgaagtgc 
aacgacaaaaagcccaacggcagcggcccctgcaccaacgtgagcaccgcgcagcgcacccacggc 
atccgccccctagtgagcacccagctgctgctgaacggcagcctggccgaggagggcgtggtgatc 
cgcagcgagaacttcaccgacaacgccaagaccaccancgcgcagccgaaggagagcgcggagatc 
aactgcacccgccccaacaacaacacccgcaagagcatcaccatcggccccggccgcgccttctac 
gccaccggcaacatcatcggcgacatccgccaggcccactgcaacatcagcggcgagaagtggaac 
aacaccctgaagcaqatcgtgaccaagctgcaggcccagttcggcaacaagaccatcgcgttcaag 
cagagcagcagcggcgaccccgagatcgtgatgcacagcttcaaccgcggcggcgagctcttccac 
tgcaacagcacccagctgttcaacagcacctggaacaacaccatcggccccaacaacaccaacggc 
accatcaccctgccccgccgcatcaagcagaccatcaaccgccggcaggaggtgggcaaggccatg 
tacgccccccccatccgcggccagatccgctgcagcagcaacaccaccggcctgccgctgacccgc 
ggcggcaaa g a g atC agcaacaccaccgagatctt:ccgccccggcggcggcgacatgcgcgac 
aactggcgcaacgagctgtacaagcacaaggtggcgaagatcgagcccccgggcgtggcccccacc 
aaggccaaacgccgcgcggtacagcgcgagaagcgcgccgcgaccccgggcgccatgttcctgggc 
ticcccgggcaccgccggcagcaccacgggcgcccgcagcccgaccccgaccgtgcaggcccgccag 
ctgccgagcggcatcgtgcagcagcagaacaacctgctgcgcgccaccgaggcccagcagcacctg 
ccgcagctaaccgtgtggggcatcaagcagctgcaggcccgcgcgcccgccgtggagcgctacctg 
aaggaccagcagctgctgggcatctggggctgcagcggcaagctgatccgcaccaccgccgtgccc 
cggaacaccagctggagcaacaagagcctggaccagatctggaacaacatgar.ctggacggagtgg 
aagcgcgaaatcgacaactacaccaacctgatctacaccctgatcgaggagagccagaaccagcag 
gagaagaacgagcaggagccgctggagctggacaagtgggccagcccccggaactggttcgacatc 
agcaagtggctgtggtacatctaactcgag 



Figure 6 



gpl40.mut7 .modSF162 . delV2 



gaattcgccaccatggatgcaatgaagagagggctctgctgtgtgctgccgctgtgtggagcagtc 
ttcgtttcgcccagcgccgtggagaagctgtgggcgaccgtgtactacggcgtgcccgtgtggaag 
gaggccaccaccaccctgttctgcgccagcgacgccaaggcctacgacaccgaggtgcacaacgtg 
tgggccacccacgcctgcgtgcccaccgaccccaaeccccaggagatcgtgctggagaacgtgacc 
gagaacttcaacatgtggaagaacaacatggtggagcagacgcacgaggacatcatcagcctgtgg 
gaccagagcctgaagccccgcgtgaagctgacccccctgtgcgtgaccctgcactgcaccaacctg 
aagaacgccaccaacaccaagagcagcaactggaaggagacggaccgcggcgagatcaagaactgc 
agctccaaggtgggcgccggcaagctgatcaactgcaacaccagcgtgatcacccaggcctgcccc 
aaggtgagcctcgagcccatccccatccaccactgcgcccccgccggctccgccaccctgaagtgc 
aacgacaagaagttcaacggcagcggcccctgcaccaacgcgagcaccgcgcagcgcacccacggc 
acccgccccgtggtgagcacccagctgctgctgaacggcagcctggccgaggagggcgtggtgatc 
cgcagcgagaacttcaccgacaacgccaagaccaccatcgtgcagctgaaggagagcgtggagatc 
aactgcacccgccccaacaacaacacccgcaagagcatcaccatcggccccggccgcgccttctac 
gccaccggcgacatcatcggcgacatccgccaggcccactgcaacatcagcggcgagaagtggaac 
aacaccctgaagcagatcgtgaccaagctgcaggcccagttcggcaacaagaccatcgtgttcaag 
cagagcagcggcggcgaccccgagatcgtgatgcacagcttcaactgcggcggcgagttcttctac 
tgcaacagcacccagctgtccaacagcacctggaacaacaccatcggccccaacaacaccaacggc 
accatcaccctgccctgccgcatcaagcagatcatcaaccgctggcaggaggtgggcaaggccatg 
tacgccccccccatccgcggccagatccgctgcagcagcaacatcaccggcctgctgctgacccgc 
gacggcggcaaggagatcagcaacaccaccgagaccccccgccccggcggcggcgacatgcgcgac 
aactggcgcagcgagctgcacaagtacaaggtgg^gaagatcgagcccctgggcgcggcccccacc 
aaagccaccagcagcgtggcgcagagcgagaagagcgccg^gaccctgggcgccatgttcctgggc 
ttcccaggcgccgccggcagcaccacgggcgcccccagcccgaccctgaccgtgcaggcccgccag 
ctgctgagcggcaccgcgcagcagcagaacaacccgctgcgcgccaccgaggcccagcagcacctg 
ctgcagctgaccgtgtggggcatcaagcagctgcaggcccgcgtgctggccgtggagcgctacctg 
aaggaccagcagctgctgggcatctggggctgcagcggcaagctgatccgcaccaccgccgtgccc 
tggaacgccagctggagcaacaagagcctggaccagacccggaacaacatgacccggatggagtgg 
gagcgcgagatcgacaactacaccaacctgatccacaccctgatcgaggagagccagaaccagcag 
gagaagaacgagcaggagctgctggagctggacaagtgggccagcctgtggaaccggttcgacatc 
agcaagtggctgtggtacatctaactcgag 



Figure 7 



9P 14 Omod . TVl . delV2 



1 gaattcatgc gcgtgatggg cacccagaag 
61 ctgggcttct ggatgctgat gatctgcaac 
121 ggcgtgcccg tgtggcgcga cgccaagacc 
181 tacgagaccg aggtgcacaa cgtgtgggcc 
241 ccccaggaga tcgtgctggg caacgtgacc 
301 gccgaccaga tgcacgagga cgtgatcagc 
361 aagctgaccc ccotgtgcgt gaccctgaac 
421 accgtgaccg gcaacagcac caacaacacc 
481 atgaagaact gcagcttcaa cgccggcgcc 
541 atcacccagg cctgccccaa ggtgagcttc 
601 gccggctacg ccatcctgaa gtgcaacaac 
661 aacgtgagca ccgtgcagtg cacccacggc 
721 ctgaacggca gcctggccga ggagggcatc 
781 accaagacca tcatcgtgca cctgaacgag 
841 aacaacaccc gcaagagcgt gcgcatcggc 
901 gtgatcggca acatccgcca ggcccactgc 
961 ctgcagcagg tgatgaagaa gctgggcgag 
1021 ccccacgccg gcggcgacct ggagatcacc 
1081 ttctactgca acaccagcaa cctgttcaac 
1141 aagtacaacg gcaacagcag cagccccatc 
1201 ogcatgtggc agggcgtggg ccaggccaoc 
1261 tgccgcagca acatcaccgg catcctgctg 
1321 aacaccgaga ccttccgccc cggcggcggc 
13 81 tacaagtaca aggtggtgga gatcaagccc 
1441 cgcgtggtgc agcgcgagaa gcgcgccgtg 
1501 ggcgccgccg gcagcaccat gggcgccgcc 
1561 ctgctgagcg gcatcgtgca gcagcagagc 
1621 cacatgctgc agctgaccgt gtggggcatc 
1681 gagcgctacc tgaaggacca gcagctgctg 
1741 tgcaccaccg ccgtgccctg gaacagcagc 
1801 gacaacatga cctggatgca gtgggaccgc 
1861 aacctgctgg aggacagcca gaaccagcag 
1921 gacaagtgga acaacctgtg gaactggttc 
1981 ctcgag 



aactgccagc agtggtggat ctggggcatc 
accgaggacc tgtgggtgac cgtgtactac 
accctgttct gcgcoagcga cgccaaggcc 
acccacgcct gcgtgcccac cgaccccaac 
gagaacttca acatgtggaa gaacgacatg 
ctgtgggacc agagcctgaa gccctgcgtg 
tgcaccgaca ccaacgtgac cggcaaccgc 
aacggcaccg gcatctacaa catcgaggag 
ggccgcctga tcaactgcaa caccagcacc* 
gaccccatcc ccatccacta ctgcgccccc 
aagaccttca acggcaccgg cccctgctac 
atcaagcecg tggtgagcac ccagctgctg 
atcatccgca gcgagaacct gaccgagaac 
agcgtggaga tcaactgcac ccgccccaac 
cccggccagg ccttctacgc caccaacgac 
aacatcagca ccgaccgctg gaacaagacc 
cacttcccca acaagaccat ccagttcaag 
atgcacagct tcaactgccg cggcgagttc 
agcacctacc acagcaacaa cggcacctac 
accctgcagt gcaagatcaa gcagatcgtg 
tacgcccccc ccatcgccgg caacatcacc 
acccgcgacg gcggcttcaa caccaccaac 
gaoatgcgcg acaactggcg cagcgagctg 
ctgggcatcg cccccaccaa ggccaagcgc 
ggcatcggcg ccgtgttcct gggcttcctg 
agcatcaccc tgaccgtgca ggcccgccag 
aacctgctga aggccatcga ggcccagcag 
aagcagctgc aggcccgcgt gctggccatc 
ggcatctggg gctgcagcgg ccgcctgatc 
tggagcaaca agagcgagaa ggacatctgg 
gagatcagca actacaccgg cctgatctac 
gagaagaacg agaaggacct gctggagctg 
gacatcagca actggccctg gtacatctaa 



Figure 8 



gpl40mod.TVl .ftiut7.delV2 



1 gaattcatgc gcgtgatggg cacccagaag aactgccagc agtggtggat ctggggcatc 
61 ctgggcttct ggatgctgat gatctgcaac accgaggacc tgtgggtgac cgtgtactac 
121 ggcgtgcccg tgtggcgcga cgccaagacc accctgttct gcgccagcga cgccaaggcc 
181 tacgagaccg aggtgcacaa cgtgtgggcc acccacgcct gcgtgcccac cgaccccaac 
241 ccccaggaga tcgtgctggg caacgtgacc gagaacttca acatgtggaa gaacgacatg 
301 gccgaccaga tgcacgagga cgtgatcagc ctgtgggacc agagcctgaa gccctgcgtg 
361 aagctgaccc ccctgtgcgt gaccctgaac tgcaccgaca ccaacgtgac cggcaaccgc 
421 accgtgaccg gcaacagcac caacaacacc aacggcaccg gcatctacaa catcgaggag 
481 atgaagaact gcagcttcaa cgccggcgcc ggccgcctga tcaactgcaa caccagcacc 
541 atcacccagg cctgccccaa ggtgagcttc gaccccatcc ccatccacta ctgcgccccc 
601 gccggctacg ccatcctgaa gtgcaacaac aagaccttca acggcaccgg cccctgctac 
661 aacgtgagca ccgtgcagtg cacccacggc atcaagcccg tggtgagcac ccagctgctg 
721 ctgaacggca gcctggccga ggagggcatc atcatccgca gcgagaacct gaccgagaac 
781 accaagacca tcatcgtgca cctgaacgag agcgtggaga tcaactgcac ccgccccaac 
841 aacaacaccc gcaagagcgt gcgcatcggc cccggccagg ccttctacgc caccaacgac 
901 gtgatcggca acatccgcca ggcccactgc aacatcagca ccgaccgctg gaacaagacc 
961 ctgcagcagg tgatgaagaa gctgggcgag cacttcccca acaagaccat ccagttcaag 
1021 ccccacgccg gcggcgacct ggagatcacc atgcacagct tcaactgccg cggcgagttc 
1081 ttctactgca acaccagcaa cctgttcaac agcacctacc acagcaacaa cggcacctac 
1141 aagtacaacg gcaacagcag cagccccatc accctgcagt gcaagatcaa gcagatcgtg 
1201 cgcatgtggc agggcgtggg ccaggccacc tacgcccccc ccatcgccgg caacatcacc 
1261 tgccgcagca acatcaccgg catcctgctg acccgcgacg gcggcttcaa caccaccaac 
1321 aacaccgaga ccttccgccc cggcggcggc gacatgcgcg acaactggcg cagcgagctg 
1381 tacaagtaca aggtggtgga gatcaagccc ctgggcatcg cccccaccaa ggccatcagc 
1441 agcgtggtgc agagcgagaa gagcgccgtg ggcatcggcg ccgtgttcct gggcttcctg 
1501 ggcgccgccg gcagcaccat gggcgccgcc agcatcaccc tgaccgtgca ggcccgccag 
1561 ctgctgagcg gcatcgtgca gcagcagagc aacctgctga aggccatcga ggcccagcag 
1621 cacatgctgc agctgaccgt gtggggcatc aagcagctgc aggcccgcgt gctggccatc 
1681 gagcgctacc tgaaggacca gcagctgctg ggcatctggg gctgcagcgg ccgcctgatc 
1741 tgcaccaccg ccgtgccctg gaacagcagc tggagcaaca agagcgagaa ggacatctgg 
1801 gacaacatga cctggatgca gtgggaccgc gagatcagca actacaccgg cctgatctac 
1861 aacctgctgg aggacagcca gaaccagcag gagaagaacg agaaggacct gctggagctg 
1921 gacaagtgga acaacctgtg gaactggttc gacatcagca actggccctg gtacatctaa 
1981 ctcgag 



Figure 9 



FIGURE 10 

gpl60mod.Q23-17 



1 ATGCGCGTGA TGGGCATCCA GCGCAACTGC CAGCACCTGC TGACCTGGGG CATCATGATC 

61 CTGGGCACCA TCATCTTCTG CAGCGCCGTG GAGAACCTGT GGGTGACCGT GTACTACGGC 

121 GTGCCCGTGT GGCGCGACGC CGACACCACC CTGTTCTGCG CCAGCGACGC CAAGGCCTAC 

181 GAGACCGAGA AGCACAACGT GTGGGCCACC CACGCCTGCG TGCCCACCGA CCCCAACCCC 

241 CAGGAGATCC ACCTGGACAA CGTGACCGAG AAGTTCAACA TGTGGAAGAA CAACATGGTG 

301 GAGCAGATGC ACACCGACAT CATCAGCCTG TGGGACCAGA GCCTGAAGCC CTGCGTGAAG 

361 CTGACCCCCC TGTGCGTGAC CCTGCACTGC ACCAACGTGA CCAGCGTGAA CACCACCGGC 

421 GACCGCGAGG GCCTGAAGAA CTGCAGCTTC AACATGACCA CCGAGCTGCG CGACAAGCGC 

481 CAGAAGGTGT ACAGCCTGTT CTACCGCCTG GACATCGTGC CCATCAACGA GAACCAGGGC 

541 AGCGAGTACC GCCTGATCAA CTGCAACACC AGCGCCATCA CCCAGGCCTG CCCCAAGGTG 

601 AGCTTCGAGC CCATCCCCAT CCACTACTGC ACCCCCGCCG GCTTCGCCAT CCTGAAGTGC 

661 AAGGACGAGG GCTTCAACGG CACCGGCCTG TGCAAGAACG TGAGCACCGT GCAGTGCACC 

721 CACGGCATCA AGCCCGTGGT GAGCACCCAG CTGCTGCTGA ACGGCAGCCT GGCCGAGAAG 

781 AACATCACCA TCCGCAGCGA GAACATCACC AACAACGCCA AGATCATCAT CGTGCAGCTG 

841 GTGCAGCCCG TGACCATCAA GTGCATCCGC CCCAACAACA ACACCCGCAA GAGCATCCGC 

901 ATCGGCCCCG GCCAGGCCTT CTACGCCACC GGCGACATCA TCGGCGACAT CCGCCAGGCC 

961 CACTGCAACG TGACCCGCAG CCGCTGGAAC AAGACCCTGC AGGAGGTGGC CGAGAAGCTG 

1021 CGCACCTACT TCGGCAACAA GACCATCATC TTCGCCAACA GCAGCGGCGG CGACCTGGAG 

1081 ATCACCACCC ACAGCTTCAA CTGCGGCGGC GAGTTCTTCT ACTGCAACAC CAGCGGCCTG 

1141 TTCAACAGCA CCTGGTACGT GAACAGCACC TGGAACGACA CCGACAGCAC CCAGGAGAGC 

1201 AACGACACCA TCACCCTGCC CTGCCGCATC AAGCAGATCA TCAACATGTG GCAGCGCGCC 

1261 GGCCAGGCCA TGTACGCCCC CCCCATCCCC GGCGTGATCA AGTGCGAGAG CAACATCACC 

1321 GGCCTGCTGC TGACCCGCGA CGGCGGCAAG GACAACAACG TGAACGAGAC CTTCCGCCCC 

1381 GGCGGCGGCG ACATGCGCGA CAACTGGCGC AGCGAGCTGT ACAAGTACAA GGTGGTGGAG 

1441 ATCGAGCCCC TGGGCGTGGC CCCCACCCGC GCCAAGCGCC GCGTGGTGGA GCGCGAGAAG 

1501 CGCGCCGTGG GCATCGGCGC CGTGTTCCTG GGCTTCCTGG GCGCCGCCGG CAGCACCATG 

1561 GGCGCCACCA GCATCACCCT GACCGTGCAG GCCCGCCAGC TGCTGAGCGG CATCGTGCAG 

1621 CAGCAGAACA ACCTGCTGCG CGCCATCGAG GCCCAGCAGC ACCTGCTGAA GCTGACCGTG 

1681 TGGGGCATCA AGCAGCTGCA GGCCCGCGTG CTGGCCGTGG AGCGCTACCT GCGCGACCAG 

17 41 CAGCTGCTGG GCATCTGGGG CTGCAGCGGC AAGCTGATCT GCACCACCAA CGTGCCCTGG 

1801 AACAGCAGCT GGAGCAACAA GAGCCTGGAC GAGATCTGGA ACAACATGAC CTGGCTGCAG 

1861 TGGGACAAGG AGATCAACAA CTACACCCAG CTGATCTACC GCCTGATCGA GGAGAGCCAG 

1921 AACCAGCAGG AGAAGAACGA GAAGGAGCTG CTGGAGCTGG ACAAGTGGGC CAACCTGTGG 

1981 AGCTGGTTCG ACATCAGCAA CTGGCTGTGG TACATCAAGA TCTTCATCAT CATCGTGGGC 

2041 GGCCTGATCG GCCTGCGCAT CGTGTTCGCC GTGCTGAGCG TGATCAACCG CGTGCGCCAG 

2101 GGCTACAGCC CCCTGAGCTT CCAGACCCAC ACCCCCAACC CCCGCGGCCT GGACCGCCCC 

2161 GAGCGCATCG AGGAGGAGGA CGGCGAGCAG GGCCGCGGCC GCAGCATCCG CCTGGTGAGC 

2221 GGCTTCCTGG CCCTGGCCTG GGACGACCTG CGCAGCCTGT GCCTGTTCAG CTACCACCGC 

2281 CTGCGCGACT TCATCCTGAT CGCCGCCCGC ACCGTGGAGC TGCTGGGCCA CAGCAGCCTG 

2341 AAGGGCCTGC GCCTGGGCTG GGAGGGCATC AAGTACCTGT GGAACCTGCT GAGCTACTGG 

24 01 GGCCGCGAGC TGAAGATCAG CGCCATCAAC CTGGTGGACA CCATCGCCAT CGCCGTGGCC 

24 61 GGCTGGACCG ACCGCGTGAT CGAGATCGCC CAGCGCATCG GCCGCGCCAT CCTGCACATC 

2521 CCCGTGCGCA TCCGCCAGGG CCTGGAGCGC GCCCTGCTGT AA 



FIGURE 11 

gpl60mod.98UA0116 



1 ATGAAGGCCC GCGGCATGCA GCGCAACTAC CAGCACCTGT GGCGCTGGGG CACCATGCTG 
61 TTCTGGATGA TCATCATGTG CAAGGCCGCC GAGAACCTGT GGGTGACCGT GTACTACGGC 
121 GTGCCCGTGT GGCGCGACGC CGAGACCACC CTGTTCTGCG CCAGCGACGC CAAGGCCTAC 
181 GACAAGGAGG TGCACAACGT GTGGGCCACC CACGCCTGCG TGCCCACCGA CCCCGACCCC 
241 CAGGAGATCA TCCTGGAGAA CGTGACCGAG AAGTTCAACA TGTGGAAGAA CAACATGGTG 
301 GAGCAGATGC AGACCGACAT CATCAGCCTG TGGGACCAGA GCCTGAAGCC CTGCGTGAAG 
361 CTGACCCCCC TGTGCGTGAC CCTGAACTGC GCCGGCCCCA GCAGCAACAA CAGCAACGTG 
421 AACAGCAACA GCAACGACAA CTGGAGCGAG GAGATGAAGA ACTGCAGCTT CAACATGACC 
481 ACCGAGCTGC GCGACAAGCG CAAGACCGTG CACAGCCTGT TCTACAAGCT GGACATCGTG 
541 AGCACCGGCA GCAACGACAG CCGCCAGTAC CGCCTGATCA ACTGCAACAC CAGCGCCATG 
601 ACCCAGGCCT GCCCCAAGGT GACCTTCGAG CCCATCCCCA TCCACTACTG CGCCCCCGCC 
661 GGCTTCGCCA TCCTGAAGTG CAAGGACACC AACTTCACCG GCACCGGCCC CTGCAAGGAC 
721 GTGAGCACCG TGCAGTGCAC CCACGGCACC AAGCCCGTGG TGAGCACCCA GCTGCTGCTG 
781 AACGGCAGCC TGGCCGAGAA GGAGGTGATG ATCCGCAGCG AGAACATCAC CGACAACGGC 
841 AAGATCATCA TCGTGCAGCT GACCGAGCCC GTGAACATCA CCCGCATCCG CCCCGGCGAG 
901 AACAAGCGCA CCAGCATCCG CATCGGCCCC GGCCAGACCT TCTACGCCAC CGGCGACGTG 
961 ATCGGCGACA TCCGCAAGGC CTACTGCAAC GTGAGCCGCG CCGCCTGGAA CAGCACCCTG 
1021 CAGAAGATCA GCACCCAGCT GCGCCAGTAC TTCAACAACA AGACCATCAT CTTCAAGAAC 
1081 AGCAGCGGCG GCGACCTGGA GGTGACCACC CACAGCTTCA ACTGCGGCGG CGAGTTCTTC 
1141 TACTGCAACA CCACCGACCT GTTCAACAGC ACCTGGAACG AGCACGGCCC CGTGACCAAC 
1201 AGCACCATGG CCAACGGCAC CATCACCCTG CCCTGCCGCA TCAAGCAGAT CATCAACATG 
1261 TGGCAGCGCG TGGGCCAGGC CATGTACGCC CCCCCCATCG AGGGCAACAT CCGCTGCGAG 
1321 AGCAACATCA CCGGCCTGCT GCTGACCCGC GACGGCGGCA GCGGCGCCAA CAGCAGCAAG 
1381 GAGACCTACC GCCCCATCGG CGGCAACATG CGCGACAACT GGCGCAGCGA GCTGTACAAG 
1441 TACAAGGTGG TGAAGATCGA GCCCATCGGC GTGGCCCCCA CCAAGGCCAA GCGCCGCGTG 
1501 GTGGAGCGCG AGAAGCGCGC CATCGGCCTG GGCGCCGCCT TCCTGGGCTT CCTGGGCGCC 
1561 GCCGGCAGCA CCATGGGCGC CGCCAGCATG ACCCTGACCG TGCAGGCCCG CCAGCTGCTG 
1621 AGCGGCATCG TGCAGCAGCA GAGCAACCTG CTGCGCGCCA TCGAGGCCCA GCAGCACCTG 
1681 CTGAAGCTGA CCGTGTGGGG CATCAAGCAG CTGCAGGCCC GCGTGCTGGC CGTGGAGCGC 
1741 TACCTGAAGG ACCAGCAGCT GCTGGGCATC TGGGGCTGCA GCGGCAAGCT GATCTGCACC 
1801 ACCAACGTGC CCTGGAACAG CAGCTGGAGC AACAAGAGCC AGAGCGAGAT CTGGGGCAAC 
1861 ATGACCTGGA TGCAGTGGGA CCGCGAGGTG ATCAACTACA CCAACATCAT CTACGACCTG 
1921 ATCGAGGAGA GCCAGAACCA GCAGGAGAAG AACGAGCAGG ACCTGCTGGC CCTGGACAAG 
1981 TGGGCCAGCC TGTGGAGCTG GTTCGACATC AGCAACTGGC TGTGGTACAT CAAGATCTTC 
2041 ATCATCATCG TGGGCGGCCT GATCGGCCTG CGCATCGTGT TCGCCGTGCT GAGCATCATC 
2101 AACCGCGCCG GCCAGGGCTA CAGCCCCCTG AGCCTGCAGA CCCTGACCCC CCACCCCGAG 
2161 GGCCCCGACC GCCCCGGCCG CATCAAGGAG GAGGGCGGCG AGCAGGACCG CGACCGCAGC 
2221 ATCCGCCTGG TGAGCGGCTT CCTGGCCCTG GCCTGGGACG ACCTGCGCAG CCTGTGCCTG 
2281 TTCAGCTACC GCCGCCTGCG CGACTTCATC AGCATCGCCG CCCGCACCGT GGAGCTGCTG 
2341 GGCCGCAGCA GCCTGAAGGG CCTGCGCCTG GGCTGGGAGG GCCTGAAGTA CCTGGGCAAC 
2401 CTGCTGGGCT ACCGCGGCCA GGAGCTGAAG AGCAGCGCCA TCAACCTGAT CGACACCATC 
2461 GCCATCGCCG TGGCCGGCTG GACCGACCGC GTGATCGAGA TCGGCCAGCG CTTCTGCCGC 
2521 GCCATCCGCA ACATCCCCCG CCGCATCCGC CAGGGCGCCG AGCGCGCCCT GCAGTAA 



FIGURE 12 

gpl60mod.SE8538 



1 ATGCGCGTGA AGGGCATCCA GCGCAACAGC CAGCACCTGC TGCGCTGGGG CACCATGATC 

61 CTGGGCATGA TCATCATCTG CAGCACCGCC GACAAGCTGT GGGTGACCGT GTACTACGGC 

121 GTGCCCGTGT GGAAGGACGC CGAGACCACC CTGTTCTGCG CCAGCGACGC CAAGGCCTAC 

181 GACACCGAGG TGCACAACGT GTGGGCCACC CACGCCTGCG TGCCCACCGA CCCCAACCCC 

241 CAGGAGCTGC ACCTGGCCAA CGTGACCGAG GAGTTCAACA TGTGGAAGAA CAGCATGGTG 

301 GAGCAGATGC ACACCGACAT CATCAGCCTG TGGGACCAGA GCCTGATCCC CTGCGTGAAG 

361 CTGACCCCCC TGTGCGTGAC CCTGGAGTGC AACGACTACA ACTACAACGT GACCAACAGC 

421 AGCCACAGCT ACAACGTGAC CAACATGCAG GAG AT GAAG A ACTGCAGCTT CAACGTGACC 

481 ACCGAGCTGC GCGACAAGCG CCAGAAGGTG ACCAGCCTGT TCTACAAGCT GGACGTGGTG 

541 CCCATCGGCG GCAACGACAC CAACAGCACC CAGTACCGCC TGATCAACTG CAACACCAGC 

601 GCCATCACCC AGGCCTGCCC CAAGGTGACC TTCGAGCCCA TCCCCATCCA CTACTGCGCC 

661 CCCGCCGGCT TCGCCATCCT GAAGTGCCGC GACGAGAACT TCAACGGCAC CGGCCCCTGC 

721 AAGAACGTGA GCACCGTGCA GTGCACCCAC GGCATCAAGC CCGTGGTGAG CACCCAGCTG 

781 CTGCTGAACG GCAGCCTGGC CCGCGAGAAG GTGATGATCC GCAGCGAGAA CATCACCAAC 

841 AACGTGAAGA ACATCATCGT GCAGCTGAAG GAGCCCGTGG AGATCAACTG CACCCGCCCC 

901 GGCAACAACA CCCGCAAGAG CATCCGCATC GGCCCCGGCC AGGCCTTCTA CGCCACCGGC 

961 GAGGTGATCG GCGACATCCG CCAGGCCCAC TGCAACGTGA GCCGCGCCAA GTGGAACAAG 

1021 ACCCTGCACG AGGTGGCCAA GCAGCTGCGC ACCTACTTCA ACAACAAGAC CATCATCTTC 

1081 ACCAACAGCA GCGGCGGCGA CCTGGAGATC ACCACCCACA CCGTGAACTG CGGCGGCGAG 

1141 TTCTTCTACT GCAACACCAG CGGCCTGTTC AACAGCACCT GGAGCAGCAA CGCCAGCGAG 

1201 CCCATGAGCA ACAGCACCGA GAGCAACGAC ACCATCACCC TGCAGTGCCG CATCCGCCAG 

1261 ATCATCAACA TGTGGCAGCG CGCCGGCAAG GCCATCTACG CCCCCCCCAT CCCCGGCATC 

1321 ATCAAGTGCG TGAGCAACAT CACCGGCCTG ATCCTGACCC GCGACGGCGG CAGCAACAAC 

1381 AGCACCAACG AGACCTTCCG CCCCGGCGGC GGCGACATGC GCGACAACTG GCGCAGCGAG 

1441 CTGTACAAGT ACAAGGTGGT GAAGATCGAG CCCCTGGGCG TGGCCCCCAC CAAGGCCAAG 

1501 CGCCGCGTGG TGGAGCGCGA GAAGCGCGCC ATCGGCATCG GCGCCGTGTT CATCGGCTTC 

1561 CTGGGCGCCG CCGGCAGCAC CATGGGCGCC GCCAGCATCA CCCTGACCGT GCAGGCCCGC 

1621 CAGCTGCTGA GCGGCATCGT GCAGCAGCAG AGCAACCTGC TGCGCGCCAT CGAGGCCCAG 

1681 CAGCACCTGC TGAAGCTGAC CGTGTGGGGC ATCAAGCAGC TGCAGGCCCG CGTGCTGGCC 

1741 GTGGAGCGCT ACCTGAAGGA CCAGCAGCTG CTGGGCATCT GGGGCTGCAG CGGCAAGCTG 

1801 ATCTGCACCA CCAACGTGCC CTGGAACAGC AGCTGGAGCA ACAAGAGCCA GAGCGAGATC 

18 61 TGGGACAACA TGACCTGGCT GCAGTGGGAC AAGGAGATCA GCAACTACAC CCAGACCATC 

1921 TACCGCCTGA TCGAGGAGAG CCAGAACCAG CAGGAGAAGA ACGAGCAGGA CCTGCTGGCC 

1981 CTGGACAAGT GGGCCAGCCT GTGGAACTGG TTCGACATCA GCCGCTGGCT GTGGTACATC 

2041 CGCATCTTCA TCATGATCGT GGGCGGCCTG ATCGGCCTGC GCATCGTGTT CGCCGTGCTG 

2101 AGCGTGATCA ACCGCGTGCG CCAGGGCTAC AGCCCCCTGA GCTTCCAGAT CCACACCCCC 

2161 AACCCCGGCG ACCTGGACCG CCCCGGCCGC ATCGAGGAGG AGGGCGGCGA GCAGGACCGC 

2221 GGCCGCAGCA TCCGCCTGGT GAGCGGCTTC CTGGCCCTGG CCTGGGACGA CCTGCGCAGC 

2281 CTGTGCCTGT TCAGCTACCA CCGCCTGCGC GACTTCATCC TGATCGCCGC CCGCACCGTG 

2341 GAGCTGCTGG GCCAGCGCGG CTGGGAGGGC CTGAAGTACC TGTGGAACCT GCTGGTGTAC 

2401 TGGATCCGCG AGCTGAAGAT CAGCGCCATC AGCCTGCTGG ACACCATCGC CATCGCCGTG 

24 61 GCCGGCTGGA CCGACCGCGT GATCGAGCTG GGCCAGCGCC TGTGCCGCGC CATCCTGCAC 
2521 ATCCCCGTGC GCATCCGCCA GGGCTTCGAG CGCGCCCTGC TGTAA 



FIGURE 13 

gpl60mod.UG031 



1 ATGCGCGTGC GCGGCATCCA GACCAGCTGG CAGAACCTGT GGCGCTGGGG CACCATGATC 
61 CTGGGCATGC TGATGATCTA CAGCGCCGCC GAGAACCTGT GGGTGACCGT GTACTACGGC 
121 GTGCCCGTGT GGAAGGACGC CGAGACCACC CTGTTCTGCG CCAGCGACGC CAAGGCCTAC 
181 GACACCGAGG TGCACAACGT GTGGGCCACC CACGCCTGCG TGCCCACCGA CCCCAACCCC 
241 CAGGAGATCC ACCTGGAGAA CGTGACCGAG GACTTCAACA TGTGGAAGAA CAACATGGTG 
301 GAGCAGATGC ACACCGACAT CATCAGCCTG TGGGACCAGA GCCTGAAGCC CTGCGTGGAG 
361 CTGACCCCCC TGTGCGTGAC CCTGGACTGC CTGAACGCCA CCCTGAACGC CACCGCCCCC 
421 AACGTGACCA ACGACATGGA GGGCGAGATG AAGAACTGCA GCTACAACAT CACCACCGAG 
481 CTGAAGGACA AGAAGCAGCA GGTGTACAGC CTGTTCTACA AGCTGGACGT GGTGCAGATC 
541 AACGAGAAGA ACAAGACCAA CAAGTACCGC CTGATCAACT GCAACACCAG CGCCATCACC 
601 CAGGCCTGCC CCAAGGTGAG CTTCGAGCCC ATCCCCATCC ACTACTGCGC CCCCGCCGGC 
661 TTCGCCATCC TGAAGTGCAA GGACACCGAG TTCAACGGCA CCGGCCCCTG CAAGAACGTG 
721 AGCACCGTGC AGTGCACCCA CGGCATCCGC CCCGTGATCA GCACCCAGCT GCTGCTGAAC 
781 GGCAGCCTGG CCGAGGGCGG CATCCAGATC CGCAGCGAGA ACATCACCAA CAACGCCAAG 
841 ACCATCATCG TGCAGCTGGA CAAGGCCGTG AAGATCAACT GCACCCGCCC CAACAACAAC 
901 ACCCGCAAGA GCGTGCGCAT CGGCCCCGGC CAGGCCTTCT ACGCCACCGG CGACATCATC 
961 GGCGACATCC GCCAGGCCCA CTGCAACGTG AGCCGCGCCA AGTGGAACGA GACCCTGCGC 
1021 GGCATCGCCA AGAAGCTGAG CGAGCACTTC AAGAACAAGA TCATCATCTT CGAGAAGAGC 
1081 AGCGGCGGCG ACATCGAGAT CACCACCCAC AGCTTCAACT GCGGCGGCGA GTTCTTCTAC 
1141 TGCAACACCA GCGGCCTGTT CAACGGCACC TGGAAGCCCA ACAGCACCGA GAGCAACAAC 
1201 ACCACCCCCA ACGACACCAT CACCCTGACC TGCCGCATCA AGCAGATCAT CAACATGTGG 
1261 CAGAAGGTGG GCCAGGCCAT GTACGCCCCC CCCATCCAGG GCGTGATCCG CTGCGAGAGC 
1321 AACATCACCG GCCTGCTGCT GACCCGCGAC GGCGGCATCA ACAGCATCAA CGAGACCTTC 
1381 CGCCCCGGCG GCGGCAACAT GCGCGACAAC TGGCGCAGCG AGCTGTACAA GTACAAGGTG 
1441 GTGAAGATCG AGCCCCTGGG CGTGGCCCCC AGCCGCGCCA AGCGCCGCGT GGTGGAGCGC 
1501 GAGAAGCGCG CCGTGGGCAT CGGCGCCGTG TTCCTGGGCT TCCTGGGCGC CGCCGGCAGC 
1561 ACCATGGGCG CCGCCAGCAT CACCCTGACC GCCCAGGCCC GCCAGCTGCT GAGCGGCATC 
1621 GTGCAGCAGC AGAGCAACCT GCTGCGCGCC ATCAAGGCCC AGCAGCACAT GCTGAAGCTG 
1681 ACCGTGTGGG GCATCAAGCA GCTGCAGGCC CGCGTGCTGG CCGTGGAGCG CTACCTGAAG 
1741 GACCAGCAGC TGCTGGGCAT CTGGGGCTGC AGCGGCAAGC TGATCTGCAC CACCAACGTG 
1801 CCCTGGAACA GCAGCTGGAG CAACAAGAGC ATGAACGAGA TCTGGGACAA CATGACCTGG 
1861 CTGCAGTGGG AGAAGGAGAT CAGCAACTAC ACCCAGCTGA TCTACAACCT GATCGAGGAG 
1921 AGCCAGAACC AGCAGGAGAA GAACGAGCAG GACCTGCTGG CCCTGGACAA GTGGGCCAGC 
1981 CTGTGGAACT GGTTCGACAT CAGCCGCTGG CTGTGGTACA TCAAGATCTT CATCATGATC 
2041 GTGGGCGGCC TGATCGGCCT GCGCATCGTG TTCGCCGTGC TGAGCGTGAT CAACCGCGTG 
2101 CGCCAGGGCT ACAGCCCCCT GAGCTTCCAG ATCCGCACCC CCAACCCCGA GGAGCCCGAC 
2161 CGCCTGGGCC GCATCGGCGA GGAGGGCGGC GAGCAGGACC GCGACCGCAG CATCCGCCTG 
2221 GTGAGCGGCT TCCTGGCCCT GGCCTGGGAC GACCTGCGCA GCCTGTGCCT GTTCAGCTAC 
2281 CACCGCCTGC GCGACTTCAT CAGCATCGCC GCCCGCACCG TGGAGCTGCT GGGCCACAGC 
2341 AGCCTGAAGG GCCTGCGCCT GGGCTGGGAG GGCCTGAAGT ACCTGTGGAA CCTGCTGCTG 
2401 TACTGGGGCC TGGAGCTGAA GACCAGCGCC GTGAACCTGG TGGACACCAT CGCCATCGCC 
24 61 GTGGCCGGCT GGACCGACCG CGTGATCGAG ATCGGCCAGC GCATCTTCCG CGCCATCCTG 
2521 AACATCCCCC GCCGCATCCG CCAGGGCCTG GAGCGCGGCC TGCTGTAA 



FIGURE 14 

gpl60mod.92UG001 



1 ATGCGCGTGC GCGAGATCGA GCGCAACTAC CTGTGCCTGT GGCGCTGGGG CATCATGCTG 

61 CTGGGCATGC TGATGACCTA CAGCGTGGCC GAGAAGAAGT GGGTGACCGT GTACTACGGC 

121 GTGCCCGTGT GGAAGGAGGC CACCACCACC CTGTTCTGCG CCAGCGACGC CAAGAGCTAC 

181 AAGACCGAGG TGCACAACAT CTGGGCCACC CACGCCTGCG TGCCCACCGA CCCCAACCCC 

241 CGCGAGATCG AGCTGGAGAA CGTGACCGAG AACTTCAACA TGTGGAAGAA CAACATGGTG 

301 GAGCAGATGC ACGAGGACAT CATCAGCCTG TGGGACCAGA GCCTGAAGCC CTGCGTGAAG 

361 CTGACCCCCC TGTGCGTGAC CCTGAACTGC ACCGACGCCC GCCGCAACGA GACCCGCAAC 

421 AACATCACCG GCATGGAGAA CAACGACCAG ATCGAGATGA AGAACTGCAG CTTCAACATC 

481 ACCACCAAGC TGATCGACAA GAAGAAGCAG GTGCACGCCC TGTTCTACCG CCTGGACGTG 

541 GTGCAGATCG ACAACGACAC CAGCAACAGC AACTACAGCA ACTACCGCCT GATCAACTGC 

601 AACACCAGCG CCATCACCCA GGCCTGCCCC AAGGTGACCT TCGAGCCCAT CCCCATCCAC 

661 TACTGCGCCC CCGCCGGCTT CGCCATCCTG AAGTGCCGCG ACAAGAAGTT CAACGGCACC 

721 GGCCCCTGCA AGAACGTGAG CACCGTGCAG TGCACCCACG GCATCCGCCC CGTGGTGAGC 

781 ACCCAGCTGC TGCTGAACGG CAGCCTGGCC GAGGAGGAGA TCATCATCCG CAGCGAGAAC 

841 CTGACCAACA ACGCCAAGAC CCTGATCGTG CAGCTGAACG AGAGCGTGGA GATCAACTGC 

901 ACCCGCCCCT ACTACAACCA GATCCGCCAG CGCACCAGCA TCGGCCAGGG CCAGGCCCTG 

961 TACACCACCC GCGTGACCGG CGACATCCGC AAGGCCTACT GCAACATCAG CAAGGCCGGC 

1021 TGGAACAAGA CCCTGCAGCA GGTGGCCAAG AAGCTGGGCG ACCTGTTCAA CCAGACCACC 

1081 ATCATCTTCA AGCCCAGCAG CGGCGGCGAC CCCGAGATCA CCACCCACAG CTTCAACTGC 

1141 GGCGGCGAGT TCTTCTACTG CAACACCAGC AAGCTGTTCA ACAGCGCCTG GAACGACAGC 

1201 ACCTGGAACA TCGGCAACAA CAACACCGGC AGCGACAACG AGACCATCAT CATCCCCTGC 

1261 CGCATCAAGC AGATCATCAA CATGTGGCAG GGCGTGGGCA AGGCCATGTA CGCCCCCCCC 

1321 ATCGAGGGCT GGATCAACTG CGCCAGCAAC ATCACCGGCC TGCTGCTGGT GCGCGACGGC 

1381 GGCGGCGCCA ACGACAGCCA GAACGAGACC TTCCGCCCCC AGGGCGGCGA CATGCGCGAC 

1441 AACTGGCGCA GCGAGCTGTA CAAGTACAAG GTGGTGAAGA TCGAGCCCCT GGGCATCGCC 

1501 CCCACCAAGG CCAAGCGCCG CGTGGTGGAG CGCGAGAAGC GCGCCATCGG CCTGGGCGCC 

1561 ATGTTCCTGG GCTTCCTGGG CGCCGCCGGC AGCACCATGG GCGCCGCCAG CCTGACCCTG 

1621 ACCGTGCAGG CCCGCCAGCT GCTGAGCGGC ATCGTGCAGC ACCAGAACAA CCTGCTGATG 

1681 GCCATCGAGG CCCAGCAGCA CCTGCTGCAG CTGACCGTGT GGGGCATCAA GCAGCTGCAG 

1741 GCCCGCATCC TGGCCGTGGA GCGCTACCTG CAGGACCAGC AGCTGCTGGG CAGCTGGGGC 

1801 TGCAGCGGCC GCCACATCTG CACCACCACC GTGCCCTGGA ACAGCAGCTG GAGCAACAAG 

1861 AGCATCGACG ACATCTGGAA CAACATGACC TGGATGGAGT GGGAGAAGGA GATCGACAAC 

1921 TACACCGGCG TGATCTACCG CCTGATCGAG GAGAGCCAGA CCCAGCAGGA GAAGAACGAG 

1981 CAGGAGCTGC TGCAGCTGGA CAAGTGGGCC AGCCTGTGGA ACTGGTTCAG CATCACCAAG 

2041 TGGCTGTGGT ACATCAAGAT CTTCATCATG ATCGTGGGCG GCCTGATCGG CCTGCGCATC 

2101 GTGTTCACCG TGCTGAGCCT GGTGAACCGC GTGCGCCAGG GCTACAGCCC CCTGAGCTTC 

2161 CAGACCCTGT TCCCCGCCCC CCGCGGCCCC GACCGCCCCG AGGAGATCGA GGAGGGCGGC 

2221 GGCGAGCAGG GCCGCGGCCG CAGCACCCGC CTGGTGAACG GCTTCAGCAC CCTGATCTGG 

2281 GACGACCTGC GCAACCTGTG CCTGTTCAGC TACCACCGCC TGCGCGACCT GATCCTGATC 

2341 GCCACCCGCA TCGTGGAGCT GCTGGGCCGC CGCGGCTGGG AGGCCATCAA GTACCTGTGG 

2401 AACCTGCTGC AGTACTGGAG CCAGGAGCTG AAGACCAGCG CCATCAGCCT GTTCAACGCC 

24 61 ACCGCCGTGG CCGTGGCCGA GGGCACCGAC CGCGTGATCG AGGTGGTGCA GCGCTTCTTC 

2521 CGCGGCATCC TGAACGTGCC CACCCGCATC CGCCAGGGCC TGGAGCGCGC CCTGCTGTAA 



FIGURE 15 

gpl60mod.94UG114 



1 ATGCGCGTGC GCGAGACCAA GCGCAACTAC CAGCACCTGT GGAAGTGGGG CACCATGCTG 
61 CTGGGCATGC TGATGATCTG CAGCGTGACC GGCAAGAGCT GGGTGACCGT GTACTACGGC 
121 GTGCCCGTGT GGAAGGAGGC CACCACCACC CTGTTCTGCG CCAGCGACGC CAAGGCCTAC 
181 AAGGCCGAGG CCCACAACAT CTGGGCCACC CACGCCTGCG TGCCCACCGA CCCCAACCCC 
241 CAGGAGATCA AGCTGGAGAA CGTGACCGAG AACTTCAACA TGTGGAAGAA CAACATGGTG 
301 GAGCAGATGC ACGAGGACAT CATCAGCCTG TGGGACCAGA GCCTGAAGCC CTGCGTGAAG 
361 CTGACCCCCC TGTGCGTGAC CCTGAACTGC ACCAACTGGG TGACCGACAC CACCAACACC 
421 ACCGGCATGG CCAACTGCAG CTTCAACATC ACCACCGAGA TCCGCGACAA GAAGAAGCAG 
481 GTGCAGGCCC TGTTCTACAA GCTGGACGTG GTGAAGATCA ACGACAACGA CAGCGACAAC 
541 ACCAGCTACC GCCTGATCAA CTGCAACACC AGCGCCATCA CCCAGGCCTG CCCCAAGATG 
601 ACCTTCGAGC CCATCCCCAT CCACTACTGC GCCCCCGCCG GCTTCGCCAT CCTGAAGTGC 
661 AACGAGAAGA AGTTCAACGG CACCGGCCCC TGCAAGAACG TGAGCACCGT GCAGTGCACC 
721 CACGGCATCA AGCCCGTGGT GAGCACCCAG CTGCTGCTGA ACGGCAGCCT GGCCGAGGAG 
781 GAGATCATCA TCCGCAGCGA GAACCTGACC AACAACGCCA AGATCATCAT CGTGCAGCTG 
841 AACGAGAGCG TGCCCATCAA CTGCATCCGC CCCTACAACA ACACCCGCCA GAGCACCCGC 
901 ATCGGCCCCG GCCAGGCCCT GTTCACCACC AAGGTGATCG GCGACATCCG CCAGGCCCAC 
961 TGCAACATCA GCGGCGCCGG CTGGAACAAG ACCCTGCAGC AGGTGGCCGA GAAGCTGGGC 
1021 AACCTGCTGA ACCAGACCAC CATCATCTTC AAGCCCAGCA GCGGCGGCGA CCCCGAGATC 
1081 ACCACCCACA GCTTCAACTG CGGCGGCGAG TTCTTCTACT GCAACACCAC CCGCCTGTTC 
1141 AACAGCACCT GGAAGCGCAA CAACAGCGAG TGGCGCAGCG ACAACACCCC CGACGAGACC 
1201 ATCACCCTGC AGTGCCGCAT CAAGCAGATC ATCAACATGT GGCAGGAGGT GGGCAAGGCC 
1261 ATGTACGCCC CCCCCATCGA GGGCTTCATC AACTGCAGCA GCAACATCAC CGGCCTGCTG 
1321 CTGACCCGCG ACGGCGGCGC CATCAACAGC AGCCAGAACG AGACCTTCCG CCCCGGCGGC 
1381 GGCGACATGC GCAACAACTG GCGCAGCGAG CTGTACAAGT ACAAGGTGGT GAAGCTGGAG 
1441 CCCATCGGCC TGGCCCCCAC CGCCGCCAAG CGCCGCGTGG TGGAGCGCGA GAAGCGCGCC 
1501 ATCGGCCTGG GCGCCCTGTT CCTGGGCTTC CTGGGCACCG CCGGCAGCAC CATGGGCGCC 
1561 GTGAGCCTGA CCCTGACCGT GCAGGCCCGC CAGGTGCTGA GCGGCATCGT GCAGCAGCAG 
1621 AACAACCTGC TGCGCGCCAT CGAGGCCCAG CAGCACCTGC TGCAGCTGAC CGTGTGGGGC 
1681 ATCAAGCAGC TGCAGGCCCG CATCCTGGCC GTGGAGAGCT ACCTGAAGGA CCAGCAGCTG 
1741 CTGGGCATCT GGGGCTGCAG CGGCAAGCAC ATCTGCACCA CCAACGTGCC CTGGAACAGC 
1801 AGCTGGAGCA ACCGCAGCGT GGACGAGATC TGGAACAACA TGACCTGGAT GGAGTGGGAG 
1861 CGCGAGATCG ACAACTACAC CGAGCTGGTG TACAGCCTGC TGGAGGTGAG CCAGATCCAG 
1921 CAGGAGAAGA ACGAGCAGGA GCTGCTGAAG CTGGACACCT GGGCCAGCCT GTGGAACTGG 
1981 TTCAGCATCA CCCAGTGGCT GTGGTACATC AAGATCTTCA TCATGATCGT GGGCGGCCTG 
2041 ATCGGCCTGC GCATCGTGTT CGCCGTGCTG AGCGTGGTGA ACCGCGTGCG CCAGGGCTAC 
2101 AGCCCCCTGA GCTTCCAGAC CCTGCTGCCC GCCCCCCGCG AGCCCGACCG CCCCGAGGGC 
2161 ATCGAGGAGG AGGGCGGCGA GCGCGACCGC GGCCGCAGCA TCCGCCTGGT GAACGGCCTG 
2221 AGCGCCCTGA TCTGGGACGA CCTGCGCAAC CTGTGCCTGT TCAGCTACCA CCGCCTGCGC 
2281 GACCTGATCC TGATCGCCGC CCGCATCGTG GAGCTGCTGG GCCGCCGCGG CTGGGAGGCC 
2341 ATCAAGTACC TGTGGAACCT GCTGCAGTAC TGGATCCAGG AGCTGAAGAA CAGCGCCGTG 
2401 AGCCTGTTCA ACACCATCGC CATCGCCGTG GCCGAGGGCA CCGACCGCGC CATCGAGCTG 
24 61 GTGCAGCGCG CCGTGCGCGC CATCCTGAAC ATCCCCGTGC GCATCCGCCA GGGCCTGGAG 
2521 CGCGCCCTGC TGTAA 



FIGURE 16 

gpl60mod.ELI 



1 ATGCGCGCCC GCGGCATCGA GCGCAACTGC CAGAACTGGT GGAAGTGGGG CATCATGCTG 
61 CTGGGCATCC TGATGACCTG CAGCGCCGCC GACAACCTGT GGGTGACCGT GTACTACGGC 
121 GTGCCCGTGT GGAAGGAGGC CACCACCACC CTGTTCTGCG CCAGCGACGC CAAGAGCTAC 
181 GAGACCGAGG CCCACAACAT CTGGGCCACC CACGCCTGCG TGCCCACCGA CCCCAACCCC 
241 CAGGAGATCG CCCTGGAGAA CGTGACCGAG AACTTCAACA TGTGGAAGAA CAACATGGTG 
301 GAGCAGATGC ACGAGGACAT CATCAGCCTG TGGGACCAGA GCCTGAAGCC CTGCGTGAAG 
361 CTGACCCCCC TGTGCGTGAC CCTGAACTGC AGCGACGAGC TGCGCAACAA CGGCACCATG 
421 GGCAACAACG TGACCACCGA GGAGAAGGGC ATGAAGAACT GCAGCTTCAA CGTGACCACC 
481 GTGCTGAAGG ACAAGAAGCA GCAGGTGTAC GCCCTGTTCT ACCGCCTGGA CATCGTGCCC 
541 ATCGACAACG ACAGCAGCAC CAACAGCACC AACTACCGCC TGATCAACTG CAACACCAGC 
601 GCCATCACCC AGGCCTGCCC CAAGGTGAGC TTCGAGCCCA TCCCCATCCA CTACTGCGCC 
661 CCCGCCGGCT TCGCCATCCT GAAGTGCCGC GACAAGAAGT TCAACGGCAC CGGCCCCTGC 
721 ACCAACGTGA GCACCGTGCA GTGCACCCAC GGCATCCGCC CCGTGGTGAG CACCCAGCTG 
781 CTGCTGAACG GCAGCCTGGC CGAGGAGGAG GTGATCATCC GCAGCGAGAA CCTGACCAAC 
841 AACGCCAAGA ACATCATCGC CCACCTGAAC GAGAGCGTGA AGATCACCTG CGCCCGCCCC 
901 TACCAGAACA CCCGCCAGCG CACCCCCATC GGCCTGGGCC AGAGCCTGTA CACCACCCGC 
961 AGCCGCAGCA TCATCGGCCA GGCCCACTGC AACATCAGCC GCGCCCAGTG GAGCAAGACC 
1021 CTGCAGCAGG TGGCCCGCAA GCTGGGCACC CTGCTGAACA AGACCATCAT CAAGTTCAAG 
1081 CCCAGCAGCG GCGGCGACCC CGAGATCACC ACCCACAGCT TCAACTGCGG CGGCGAGTTC 
1141 TTCTACTGCA ACACCAGCGG CCTGTTCAAC AGCACCTGGA ACATCAGCGC CTGGAACAAC 
1201 AT CACCGAGA GCAACAACAG CACCAACACC AACATCACCC TGCAGTGCCG CATCAAGCAG 
1261 ATCATCAAGA TGGTGGCCGG CCGCAAGGCC ATCTACGCCC CCCCCATCGA GCGCAACATC 
1321 CTGTGCAGCA GCAACATCAC CGGCCTGCTG CTGACCCGCG ACGGCGGCAT CAACAACAGC 
1381 ACCAACGAGA CCTTCCGCCC CGGCGGCGGC GACATGCGCG ACAACTGGCG CAGCGAGCTG 
1441 TACAAGTACA AGGTGGTGCA GATCGAGCCC CTGGGCGTGG CCCCCACCCG CGCCAAGCGC 
1501 CGCGTGGTGG AGCGCGAGAA GCGCGCCATC GGCCTGGGCG CCATGTTCCT GGGCTTCCTG 
1561 GGCGCCGCCG GCAGCACCAT GGGCGCCCGC AGCGTGACCC TGACCGTGCA GGCCCGCCAG 
1621 CTGATGAGCG GCATCGTGCA GCAGCAGAAC AACCTGCTGC GCGCCATCGA GGCCCAGCAG 
1681 CACCTGCTGC AGCTGACCGT GTGGGGCATC AAGCAGCTGC AGGCCCGCAT CCTGGCCGTG 
1741 GAGCGCTACC TGAAGGACCA GCAGCTGCTG GGCATCTGGG GCTGCAGCGG CAAGCACATC 
1801 TGCACCACCA ACGTGCCCTG GAACAGCAGC TGGAGCAACC GCAGCCTGAA CGAGATCTGG 
1861 CAGAACATGA CCTGGATGGA GTGGGAGCGC GAGATCGACA ACTACACCGG CCTGATCTAC 
1921 AGCCTGATCG AGGAGAGCCA GACCCAGCAG GAGAAGAACG AGAAGGAGCT GCTGGAGCTG 
1981 GACAAGTGGG CCAGCCTGTG GAACTGGTTC AGCATCACCC AGTGGCTGTG GTACATCAAG 
2041 ATCTTCATCA TGATCATCGG CGGCCTGATC GGCCTGCGCA TCGTGTTCGC CGTGCTGAGC 
2101 CTGGTGAACC GCGTGCGCCA GGGCTACAGC CCCCTGAGCT TCCAGACCCT GCTGCCCGCC 
2161 CCCCGCGGCC CCGACCGCCC CGAGGGCACC GAGGAGGAGG GCGGCGAGCG CGGCCGCGAC 
2221 CGCAGCGTGC GCCTGCTGAA CGGCTTCAGC GCCCTGATCT GGGACGACCT GCGCAGCCTG 
2281 TGCCTGTTCA GCTACCACCG CCTGCGCGAC CTGATCCTGA TCGCCGTGCG CATCGTGGAG 
2341 CTGCTGGGCC GCCGCGGCTG GGACATCCTG AAGTACCTGT GGAACCTGCT GCAGTACTGG 
2401 AGCCAGGAGC TGCGCAACAG CGCCAGCAGC CTGTTCGACG CCATCGCCAT CGCCGTGGCC 
24 61 GAGGGCACCG ACCGCGTGAT CGAGATCATC CAGCGCGCCT GCCGCGCCGT GCTGAACATC 
2521 CCCCGCCGCA TCCGCCAGGG CCTGGAGCGC AGCCTGCTGT AA 



FIGURE 17 

gpl60mod. 93IN101 



1 ATGCGCGTGC GCGGCACCCT GCGCAACTAC CAGCAGTGGT GGATCTGGGG CGTGCTGGGC 
61 TTCTGGATGC TGATGATCTG CAACGGCGGC GGCAACCTGT GGGTGACCGT GTACTACGGC 
121 GTGCCCGTGT GGAAGGAGGC CAAGACCACC CTGCTGTGCG CCAGCGACGC CAAGGCCTAC 
181 GAGCGCGAGG TGCACAACGT GTGGGCCACC CACGCCTGCG TGCCCACCGA CCCCAACCCC 
241 CAGGAGATCG TGCTGGGCAA CGTGACCGAG AACTTCAACA TGTGGAAGAA CGACATGGTG 
301 GACCAGATGC ACGAGGACGT GATCAGCCTG TGGGACCAGA GCCTGAAGCC CTGCGTGAAG 
361 CTGACCCCCC TGTGCGTGAC CCTGGAGTGC CGCAACGTGA GCCGCAACGT GAGCAGCTAC 
421 AACACCTACA ACGGCAGCGT GGAGGAGATC AAGAACTGCA GCTTCAACGC CACCCCCGAG 
481 GTGCGCGACC GCAAGCAGCG CATGTACGCC CTGTTCTACG GCCTGGACAT CGTGCCCCTG 
541 AACAAGAAGA ACAGCAGCGA GAACAGCAGC GAGTACCGCC TGATCAACTG CAACACCAGC 
601 GCCATCACCC AGGCCTGCCC CAAGGTGACC TTCGACCCCA TCCCCATCCA CTACTGCGCC 
661 CCCGCCGGCT ACGCCATCCT GAAGTGCAAC AACAAGACCT TCAACGGCAC CGGCCCCTGC 
721 AACAACGTGA GCACCGTGCA GTGCACCCAC GGCATCAAGC CCGTGGTGAG CACCCAGCTG 
781 CTGCTGAACG GCAGCCTGGC CGAGGGCGAG ATCATCATCC GCAGCGAGAA CCTGACCAAC 
841 AACGTGAAGA CCATCATCGT GCACCTGAAC CAGAGCGTGG AGATCGTGTG CACCCGCCCC 
901 AACAACAACA CCCGCAAGAG CATCCGCATC GGCCCCGGCC AGACCTTCTA CGCCACCGGC 
961 GACATCATCG GCGACATCCG CCAGGCCCAC TGCAACATCA GCCGCGACAA GTGGAACGAG 
1021 ACCCTGCAGC GCGTGGGCAA GAAGCTGGCC GAGCACTTCC ACAACAAGAC CATCAAGTTC 
1081 GCCAGCAGCA GCGGCGGCGA CCTGGAGATC ACCACCCACA GCTTCAACTG CCGCGGCGAG 
1141 TTCTTCTACT GCAACACCAG CGGCCTGTTC AACGGCACCT ACATGCCCAC CTACATGCCC 
1201 AACGGCACCG AGAGCAACAG CAACAGCACC ATCACCATCC CCTGCCGCAT CAAGCAGATC 
12 61 ATCAACATGT GGCAGGAGGT GGGCCGCGCC ATGTACGCCC CCCCCATCGC CGGCAACATC 
1321 ACCTGCACCA GCAACATCAC CGGCCTGCTG CTGGTGCACG ACGGCGGCAT CAAGGAGAAC 
1381 GACACCGAGA ACAAGACCGA GATCTTCCGC CCCGGCGGCG GCGACATGCG CGACAACTGG 
1441 CGCAGCGAGC TGTACAAGTA CAAGGTGGTG GAGATCAAGC CCCTGGGCGT GGCCCCCACC 
1501 GCCGCCAAGC GCCGCGTGGT GGAGCGCGAG AAGCGCGCCG TGGGCATCGG CGCCGTGTTC 
1561 CTGGGCTTCC TGGGCGCCGC CGGCAGCACC ATGGGCGCCG CCAGCATCAC CCTGACCGCC 
1621 CAGGCCCGCC AGCTGCTGAG CGGCATCGTG CAGCAGCAGA GCAACCTGCT GCGCGCCATC 
1681 GAGGCCCAGC AGCACCTGCT GCAGCTGACC GTGTGGGGCA TCAAGCAGCT GCAGACCCGC 
1741 GTGCTGGCCA TCGAGCGCTA CCTGAAGGAC CAGCAGCTGC TGGGCATCTG GGGCTGCAGC 
1801 GGCAAGCTGA TCTGCACCAC CGCCGTGCCC TGGAACAGCA GCTGGAGCAA CAAGACCCAG 
1861 AGCGAGATCT GGAACAACAT GACCTGGATG CAGTGGGACC GCGAGGTGAG CAACTACACC 
1921 AACATCATCT ACAGCCTGCT GGAGGAGAGC CAGAACCAGC AGGAGAAGAA CGAGAAGGAC 
1981 CTGCTGGCCC TGGACAGCTG GAAGAACCTG TGGAGCTGGT TCGACATCAC CAACTGGCTG 
2041 TGGTACATCA AGATCTTCAT CATGATCGTG GGCGGCCTGA TCGGCCTGCG CATCATCTTC 
2101 GCCGTGCTGA GCATCGTGAA CCGCGTGCGC CAGGGCTACA GCCCCCTGAG CTTCCAGACC 
2161 CTGACCCCCA ACCCCCGCGG CCCCGACCGC CTGGGCCGCA TCGAGGAGGA GGGCGGCGAG 
2221 CAGGACAAGG ACCGCAGCAT CCGCCTGGTG AACGGCTTCC TGGCCCTGGC CTGGGACGAC 
2281 CTGCGCAACC TGTGCCTGTT CAGCTACCAC CGCCTGCGCG ACTTCATCAG CGTGGCCGCC 
2341 CGCGTGGTGG AGCTGCTGGG CCGCAGCAGC TGGGAGGCCC TGAAGTACCT GGGCAGCCTG 
24 01 GTGCAGTACT GGGGCCTGGA GCTGAAGAAG AGCGCCATCA GCCTGTTCGA CAGCATCGCC 
24 61 ATCGTGGTGG CCGAGGGCAC CGACCGCATC ATCGAGCTGG TGCAGGGCTT CTGCCGCGCC 
2521 ATCCGCAACA TCCCCACCCG CATCCGCCAG GGCTTCGAGG CCGCCCTGCA GTAA 
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1 ATGGATGCAA TGAAGAGAGG GCTCTGCTGT GTGCTGCTGC TGTGTGGAGC AGTCTTCGTT 
61 TCGCCCAGCG CTAGCAACAA CCTGTGGGTG ACCGTGTACT ACGGCGTGCC CGTGTGGCGC 
121 GACGCCGACA CCACCCTGTT CTGCGCCAGC GACGCCAAGG CCCACGAGAC CGAGGTGCAC 
181 AACGTGTGGG CCACCCACGC CTGCGTGCCC ACCGACCCCA ACCCCCAGGA GATCCACCTG 
241 GAGAACGTGA CCGAGAACTT CAACATGTGG AAGAACAACA TGGTGGAGCA GATGCAGGAG 
301 GACGTGATCA GCCTGTGGGA CCAGAGCCTG AAGCCCTGCG TGAAGCTGAC CCCCCTGTGC 
361 GTGACCCTGA ACTGCACCAA CGCCAAGCTG ACCAACGTGA ACAACATCAC CAGCGTGAGC 
421 AACACCATCG GCAACATCAC CGACGAGGTG CGCAACTGCA GCTTCAACAT GACCACCGAG 
481 CTGCGCGACA AGAAGCAGAA GGTGCACGCC CTGTTCTACA AGCTGGACAT CGTGCCCATC 
541 GAGGACAACA AGACCAGCAG CGAGTACCGC CTGATCAACT GCAACACCAG CGTGATCAAG 
601 CAGGCCTGCC CCAAGATCAG CTTCGACCCC ATCCCCATCC ACTACTGCAC CCCCGCCGGC 
661 TACGCCATCC TGAAGTGCAA CGACAAGAAC TTCAACGGCA CCGGCCCCTG CAAGAACGTG 
721 AGCAGCGTGC AGTGCACCCA CGGCATCAAG CCCGTGGTGA GCACCCAGCT GCTGCTGAAC 
781 GGCAGCCTGG CCGAGGAGGA GATCATCATC CGCAGCGAGA ACCTGACCAA CAACGCCAAG 
841 ACCATCATCG TGCACCTGAA CAAGAGCGTG GAGATCAACT GCACCCGCCC CAGCAACAAC 
901 ACCCGCACCA GCATCACCAT CGGCCCCGGC CAGGTGTTCT ACCGCACCGG CGACATCATC 
961 GGCGACATCC GCAAGGCCTA CTGCGAGATC AACGGCACCA AGTGGAACGA GGTGCTGACC 
1021 CAGGTGACCG AGAAGCTGAA GGAGCACTTC AACAACAAGA CCATCATCTT CCAGCCCCCC 
1081 AGCGGCGGCG ACCTGGAGAT CACCATGCAC CACTTCAACT GCCGCGGCGA GTTCTTCTAC 
1141 TGCAACACCA CCCGCCTGTT CAACAACACC TGCATCGAGA ACGGCACCAT GGGCGGCTGC 
1201 AACGGCACCA TCATCCTGCC CTGCAAGATC AAGCAGATCA TCAACATGTG GCAGGGCGCC 
1261 GGCCAGGCCA TGTACGCCCC CCCCATCAGC GGCCGCATCA ACTGCGTGAG CAACATCACC 
1321 GGCATCCTGC TGACCCGCGA CGGCGGCGCC ATCAACACCA CCAACGAGAC CTTCCGCCCC 
1381 GGCGGCGGCA ACATCAAGGA CAACTGGCGC AGCGAGCTGT ACAAGTACAA GGTGGTGCAG 
1441 ATCGAGCCCC TGGGCATCGC CCCCACCCGC GCCAAGCGCC GCGTGGTGGA GCGCGAGAAG 
1501 CGCGCCGTGG GCATCGGCGC CATGATCTTC GGCTTCCTGG GCGCCGCCGG CAGCACCATG 
1561 GGCGCCGCCA GCATCACCCT GACCGTGCAG GCCCGCCAGC TGCTGAGCGG CATCGTGCAG 
1621 CAGCAGAGCA ACCTGCTGCG CGCCATCGAG GCCCAGCAGC ACCTGCTGCA GCTGACCGTG 
1681 TGGGGCATCA AGCAGCTGCA GGCCCGCGTG CTGGCCGTGG AGCGCTACCT GAAGGACCAG 
1741 AAGTTCCTGG GCCTGTGGGG CTGCAGCGGC AAGATCATCT GCACCACCGC CGTGCCCTGG 
1801 AACAGCACCT GGAGCAACCG CAGCTACGAG GAGATCTGGA ACAACATGAC CTGGATCGAG 
1861 TGGGAGCGCG AGATCAGCAA CTACACCAAC CAGATCTACG AGATCCTGAC CGAGAGCCAG 
1921 AACCAGCAGG ACCGCAACGA GAAGGACCTG CTGGAGCTGG ACAAGTGGGC CAGCCTGTGG 
1981 AACTGGTTCG ACATCACCAA GTGGCTGTGG TACATCAAGA TCTTCATCAT GATCATCGGC 
2041 GGCCTGATCG GCCTGCGCAT CATCTTCGCC GTGCTGAGCA TCGTGAACCG CGTGCGCCAG 
2101 GGCTACAGCC CCCTGAGCTT CCAGACCCCC TTCCACCACC AGCGCGAGCC CGACCGCAGC 
2161 GAGCGCATCG AGGAGGGCGG CGGCGAGCAG GGCCGCGACC GCAGCGTGCG CCTGGTGAGC 
2221 GGCTTCCTGG CCCTGGCCTG GGACGACCTG CGCAGCCTGT GCCTGTTCAG CTACCACCGC 
2281 CTGCGCGACT TCATCCTGAT CGCCGCCCGC ACCGTGAAGC TGCTGGGCCG CAGCAGCCTG 
2341 AAGGGCCTGC GCCGCGGCTG GGAGGGCCTG AAGTACCTGG GCAACCTGCT GCTGTACTGG 
24 01 GGCCAGGAGC TGAAGATCAG CGCCATCAGC CTGCTGGACG CCACCGCCAT CATCGTGGCC 
24 61 GGCTGGACCG ACCGCGTGAT CGAGGTGGCC CAGGGCGCCT GGCGCGCCAT CCTGCACATC 
2521 CCCCGCCGCA TCCGCCAGGG CCTGGAGCGC ACCCTGCTGT AA 
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1 ATGGATGCAA TGAAGAGAGG GCTCTGCTGT GTGCTGCTGC TGTGTGGAGC AGTCTTCGTT 
61 TCGCCCAGCG CTAGCAACAA CTTGTGGGTT ACAGTTTATT ATGGGGTTCC TGTGTGGAGA 
121 GATGCAGATA CCACCCTATT TTGTGCATCA GATGCCAAAG CACATGAGAC AGAAGTGCAC 
181 AATGTCTGGG CCACACATGC CTGTGTACCC ACAGACCCCA ACCCACAAGA AATACACCTG 
241 GAAAATGTAA CAGAAAATTT TAACATGTGG AAAAATAACA TGGTAGAGCA GATGCAGGAG 
301 GATGTAATCA GTTTATGGGA TCAAAGTCTA AAGCCATGTG TAAAGTTAAC TCCTCTCTGC 
361 GTTACTTTAA ATTGTACCAA TGCTAAGTTG ACCAATGTCA ATAACATAAC CAGTGTCTCT 
421 AACACAATAG GAAATATAAC AGATGAAGTA AGAAACTGTT CTTTTAATAT GACCACAGAA 
481 CTAAGAGATA AGAAGCAGAA GGTCCATGCA CTTTTTTATA AGCTTGATAT AGTACCAATT 
541 GAAGATAATA AGACTAGTAG TGAGTATAGG TTAATAAATT GTAATACTTC AGTCATTAAG 
601 CAGGCTTGTC CAAAGATATC CTTTGATCCA ATTCCTATAC ATTATTGTAC TCCAGCTGGT 
661 TATGCGATTT TAAAGTGTAA TGATAAGAAT TTCAATGGGA CAGGGCCATG TAAAAATGTC 
721 AGCTCAGTAC AATGCACACA TGGAATTAAG CCAGTGGTAT CAACTCAATT GCTGTTAAAT 
781 GGCAGTCTAG CAGAAGAAGA GATAATAATC AGATCTGAAA ATCTCACAAA CAATGCCAAA 
841 ACCATAATAG TGCACCTTAA TAAATCTGTA GAAATCAATT GTACCAGACC CTCCAACAAT 
901 ACAAGAACAA GTATAACTAT AGGACCAGGA CAAGTATTCT ATAGAACAGG AGACATAATA 
961 GGAGATATAA GAAAAGCATA TTGTGAGATT AATGGAACAA AATGGAATGA AGTTTTAACA 
1021 CAGGTAACTG AAAAATTAAA AGAGCACTTT AATAATAAGA CAATAATCTT TCAACCACCC 
1081 TCAGGAGGAG ATCTAGAAAT TACAATGCAT CATTTTAATT GTAGAGGGGA ATTTTTCTAT 
1141 TGCAATACAA CACGACTGTT TAATAATACT TGCATAGAAA ATGGAACCAT GGGGGGGTGT 
1201 AATGGCACTA TCATACTTCC ATGCAAGATA AAGCAAATTA TAAACATGTG GCAGGGAGCA 
1261 GGACAAGCAA TGTATGCTCC TCCCATCAGT GGAAGAATTA ATTGTGTATC AAATATTACA 
1321 GGAATACTAT TGACAAGAGA TGGTGGTGCT ATTAATACAA CTAATGAGAC CTTCCGCCCC 
1381 GGCGGCGGCA ACATCAAGGA CAACTGGCGC AGCGAGCTGT ACAAGTACAA GGTGGTGCAG 
1441 ATCGAGCCCC TGGGCATCGC CCCCACCCGC GCCAAGCGCC GCGTGGTGGA GCGCGAGAAG 
1501 CGCGCCGTGG GCATCGGCGC CATGATCTTC GGCTTCCTGG GCGCCGCCGG CAGCACCATG 
1561 GGCGCCGCCA GCATCACCCT GACCGTGCAG GCCCGCCAGC TGCTGAGCGG CATCGTGCAG 
1621 CAGCAGAGCA ACCTGCTGCG CGCCATCGAG GCCCAGCAGC ACCTGCTGCA GCTGACCGTG 
1681 TGGGGCATCA AGCAGCTGCA GGCCCGCGTG CTGGCCGTGG AGCGCTACCT GAAGGACCAG 
1741 AAGTTCCTGG GCCTGTGGGG CTGCAGCGGC AAGATCATCT GCACCACCGC CGTGCCCTGG 
1801 AACAGCACCT GGAGCAACCG CAGCTACGAG GAGATCTGGA ACAACATGAC CTGGATCGAG 
1861 TGGGAGCGCG AGATCAGCAA CTACACCAAC CAGATCTACG AGATCCTGAC CGAGAGCCAG 
1921 AACCAGCAGG ACCGCAACGA GAAGGACCTG CTGGAGCTGG ACAAGTGGGC CAGCCTGTGG 
1981 AACTGGTTCG ACATCACCAA GTGGCTGTGG TACATCAAAA TATTTATAAT GATAATAGGA 
2 041 GGTTTAATAG GTTTAAGGAT AATTTTTGCT GTGCTTTCTA TAGTGAATAG AGTTAGGCAG 
2101 GGATACTCAC CTTTGTCTTT CCAGACCCCT TTCCATCATC AGAGGGAACC CGACAGATCC 
2161 GAAAGAATCG AAGAAGGAGG TGGCGAGCAA GGCAGAGACA GATCCGTGCG ATTAGTGAGC 
2221 GGATTCTTAG CTCTTGCGTG GGACGATCTA CGGAGCCTGT GCCTCTTCAG CTACCACCGC 
2281 TTGAGAGACT TCATCTTGAT TGCAGCGAGG ACTGTGAAAC TTCTGGGACG CAGCAGTCTC 
2341 AAGGGACTGA GACGGGGGTG GGAAGGTCTC AAATATCTGG GGAATCTTCT GTTATATTGG 
2401 GGTCAGGAAC TAAAAATTAG CGCTATTTCT TTGCTTGATG CTACAGCAAT AATAGTAGCG 
24 61 GGGTGGACAG ATAGGGTTAT AGAAGTAGCA CAAGGAGCTT GGAGAGCCAT TCTCCACATA 
2521 CCTAGGAGAA TCAGACAGGG CTTAGAAAGG ACTTTGCTAT AA 
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